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ABSTRACT 

Context. Gaia is Europe’s space astrometry mission, aiming to make a three-dimensional map of 1,000 million stars in our Milky 
Way to unravel its kinematical, dynamical, and chemical structure and evolution. 

Aims. We present a study of Gaia’s detection capability of objects, in particular non-saturated stars, double stars, unresolved external 
galaxies, and asteroids. Gaia’s on-board detection software autonomously discriminates stars from spurious objects like cosmic rays 
and Solar protons. For this, parametrised point-spread-function-shape criteria are used, which need to be calibrated and tuned. This 
study aims to provide an optimum set of parameters for these filters. 

Methods. We developed a validated emulation of the on-board detection software, which has 20 free, so-called rejection parameters 
which govern the boundaries between stars on the one hand and sharp (high-frequency) or extended (low-frequency) events on the 
other hand. We evaluate the detection and rejection performance of the algorithm using catalogues of simulated single stars, resolved 
and unresolved double stars, cosmic rays, Solar protons, unresolved external galaxies, and asteroids. 

Results. We optimised the rejection parameters, improving - with respect to the functional baseline - the detection performance of 
single stars and of unresolved and resolved double stars, while, at the same time, improving the rejection performance of cosmic 
rays and of Solar protons. The optimised rejection parameters also remove the artefact of the functional-baseline parameters that 
the reduction of the detection probability of stars as function of magnitude already sets in before the nominal faint-end threshold 
at G = 20 mag. We find, as a result of the rectangular pixel size, that the minimum separation to resolve a close, equal-brightness 
double star is 0.23 arcsec in the along-scan and 0.70 arcsec in the across-scan direction, independent of the brightness of the primary. 
To resolve double stars with AG > 0 mag, larger separations are required. We find that, whereas the optimised rejection parameters 
have no significant impact on the detectability of pure de Vaucouleurs profiles, they do significantly improve the detection of pure 
exponential-disk profiles, and hence also the detection of unresolved external galaxies with intermediate profiles. We also find that the 
optimised rejection parameters provide detection gains for asteroids fainter than 20 mag and for fast-moving near-Earth objects fainter 
than 18 mag, albeit this gain comes at the expense of a modest detection-probability loss for bright, fast-moving near-Earth objects. 
The major side effect of the optimised parameters is that spurious ghosts in the wings of bright stars essentially pass unfiltered. 

Key words. Space vehicles: instruments; Stars: general; Stars: binaries: general; Galaxies: general; Cosmic rays; Minor planets, 
asteroids: general 


1. Introduction 


Gaia (e.g., |Perryman et al.| |2001[ |Lindegren et al.| |2008| ) is 


the current astrometry mission of the European Space Agency 
(ESA), following up on the success of the Hipparcos mission 
( |ESA| |1997[ |Perryman et al.| |1997| | Perryman] |2009| ). Gaia’s 


objective is to unravel the kinematical, dynamical, and chemi¬ 
cal structure and evolution of our Galaxy, the Milky Way (e.g., 
|Gomez et al.[|2010] ). In addition, Gaia’s data will revolutionise 
many other areas of astronomy, e.g., stellar structure and evolu¬ 
tion, stellar variability, double and multiple stars, Solar-system 
bodies, extra-galactic objects, fundamental physics, and exo- 


planets (e.g., Pourbaix, 

2008, Tanga et al., 2012^Mignard & 

Klioner, 2010 ;|Eyer et a 

.| 2011; Sozzetti| 2011; Mouret[ 2011; 

Tsalmantza et al. | 2009; 

Krone-Martins et al., 2013). During its 


five-year lifetime, Gaia will survey the full sky and repeatedly 
observe the brightest 1,000 million objects, down to 20 th magni¬ 
tude (e.g., |de Bruijne et aL]|2010| ). Gaia’s science data comprises 
absolute astrometry, broad-band photometry, and low-resolution 


spectro-photometry. Medium-resolution spectroscopic data will 
be obtained for the brightest 150 million sources, down to 17 th 
magnitude. The final Gaia catalogue, due in 2022, will con¬ 
tain astrometry (positions, parallaxes, and proper motions) with 
standard errors less than 10 micro-arc second (//as, //as yr -1 for 
proper motions) for stars brighter than 12 th magnitude, 25 //as for 
stars a t 15 th magnitude, and 300 //as at magnitu de 20 (de Bruijnel 
2012 ). Milli-magnitude-precisionphotometry ( [Jordi et al.||2010| i 


allows to get a handle on effective temperature, surface gravity, 
metallicity, and reddening of all stars (Bailer-Jones 2010| 


Liu 


et al.| [2012] ). The spectroscopic data may allow the determina¬ 
tion of radial velocities with error s of 1 km s~ l at th e brig ht end 
and 15 km s _1 at magnitude 17 (Wilkinson et al. 


2005 


Katz 


et al. 2011) as well as astrophysical diagnostics such as effective 


temperature and metallicity for the brightest few million objects 
( Kordop atis et al.||~2011| ). Clearly, these performances will only 
be reached with a total of five years of collected data and after 
careful calibration and extensive data processing. 
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Gaia is a survey mission and the spacecraft continuously 
scans the sky. The inertial rotation rate is 60 arcsec per second - 
which means the rotation period is 6 hours - and a slow preces¬ 
sion of the spin axis at a fixed, 45-degree angle to the Sun allows 
to reach full-sky coverage after some 6 months. On average, stars 
are seen about 70 times during the five-year mission. The slow 
rotation of the spacecraft causes stars to drift through the fo¬ 
cal plane. The CCD detectors in the focal plane are hence oper¬ 
ated in Time-Delayed Integration (TDI) mode, which means that 
the charges are clocked in the scanning direction - also called 
along-scan (AL) direction, as opposed to the orthogonal direc¬ 
tion, which is referred to as the across-scan (AC) direction - at 
the same speed as the optical image moves over the CCD sur¬ 
face. The object images thus gradually build up in intensity be¬ 
fore reaching the read-out register of each CCD. The precession 
of the spin axis causes a small, time-variable across-scan motion 
of the optical image on the CCD, up to 4 across-scan pixels over 
a 4.42-second CCD transit. 


The Gaia focal-plane assembly (e.g., [Kohley et al.| [2012] ), 
with 106 CCD detectors, has five dedicated functions: 4 
CCDs for metrology, i.e., ba s ic-ang le monitoring and wave- 
front sensing ( [Gielesen et al.| |2012 |Mora & Vosteen| |2012| ), 
14 Sky Mapper (SM) CCDs for object detection and re¬ 
jection of prompt-particle events, 62 Astrometric Field (AF) 
CCDs, 14 Blue-Photometer/Red-Photometer (BP/RP) CCDs 
for low-resolution spectro-photometry, and 12 Radial-Velocity- 
Spectrograph (RVS) CCDs for radial velocities and medium- 
resolution spectra. The AF, BP/RP, and RVS CCDs see the su¬ 
perimposed light coming from the two telescopes, which look at 
the sky separated by a basic angle of 106.5 deg along the scan 
direction. The SM CCDs, in contrast, either see the light from 
one telescope or the light from the other telescope. The CCDs 
are distributed over seven independent rows; a star transiting the 
focal plane sees the following CCDs in time order: either SMI 
or SM2, AF1.. .AF9, BP, RP, and RVS1.. .RVS3; RVS is only 
present for four of the seven rows. Two particular aspects of 
Gaia’s design worth recalling here are its rectangular aperture 
ratio (1.45 x 0.50 m 1 2 , i.e., 3 : 1) and its rectangular pixel size 
(10 x 30 yum 2 , i.e., 1 : 3). This configuration allows the along- 
and across-scan images - at least of point sources - to roughly 
have the same size expressed in units of pixels. 

Contrary to the Hipparcos mission, which selected its targets 
for observationbased on apre-defined input catalogue loaded on 
board ( [Turon et aL}|1992| , Gaia will perform an unbiased survey 
of the sky. Since an all-sky input catalogue at the Gaia spatial 
resolution complete down to 20 th magnitude does not exist, there 
has essentially been no choice but to implement on-board object 
detection, with the associated advantage that transient sources 
(supernovae, near-Earth asteroids, etc.) will not escape Gaia’s 
eyes. The downside of on-board object detection is the associ¬ 
ated need for hard- and software, which needs to be fully au¬ 
tonomous and near-perfect for all scientific targets over the mag¬ 
nitude range 6-20 mag (which represents a dynamic range of 
400,000) yet at the same time needs to be robust against real-sky 
complexities like double stars, extended objects (such as exter¬ 
nal galaxies, near-Earth asteroids, or planets like Jupiter), neb¬ 
ulosity, crowding, and Galactic cosmic rays and Solar protons, 
and, in addition, needs to process “full-frame” SM data (in TDI 
mode) in real-time: the continuous spin of the spacecraft causes 
a new TDI line with information to enter the CCD read-out regis¬ 
ter every milli-second. And all that, of course, running on space- 
qualified hardware operated in the hostile environment called 
“space” with severe requirements on and limitations of process¬ 
ing margins, reliability, mass, power, heat dissipation, etc. 


Each CCD row in the focal plane is controlled by a separate 
Video Processing Unit (VPU). A VPU is a combination of hard¬ 
ware (composed of a pre-processing and a PowerPC board) and 
associated software which, based on time strobes delivered by 
the atomic clock, commands and controls the CCDs and asso¬ 
ciated electronics, extracts and processes the science data, and 
delivers star packets with science data to the on-board storage 
area, from where the data is (later) transmitted to ground. The 
VPU software responsible for the science-data acquisition and 
processing is called the Video Processing Algorithms (VPAs). 
The VPA prototypes have been developed by Gaia’s industrial 
prime contractor Airbus Defence & Space in Toulouse, France, 
and implemented by Airbus Defence & Space Ltd in Stevenage, 
United Kingdom, under ESA contract. 

Among the many functional responsibilities of the VPAs 
(e.g., supporting attitude-control-loop convergence and mainte¬ 
nance, metrology functions, etc.), the object detection in the SM 
CCDs is of key importance to the success of the Gaia mission. 
A critical task of the detection stage is to discriminate stars from 
prompt-particle events, like Galactic cosmic rays and Solar pro¬ 
tons, which provide a continuous background of spurious events 
on the CCDs. These events need to be filtered out as much as 
possible at the detection stage since they could otherwise un¬ 
necessarily consume telemetry bandwidth and could even pre¬ 
vent stars from being observed. The problem essentially boils 
down to a trade-off between catalogue completeness and false- 
detection rates, and this trade-off is at the core of this work. The 
detection algorithms, described in detail in Section [2| contain a 
large number of configurable parameters. In this paper, we focus 
on 20 of the most important parameters and describe a method to 
optimise these in Section [4] based on simulated data sets of sin¬ 
gle stars, double stars, Galactic cosmic rays, Solar protons, un¬ 
resolved external galaxies, and asteroids which are described in 
Section [3] Our results are presented in Section [5] and further dis¬ 
cussed in Section [6] Scientific implications and conclusions of 
our work can be found in Sections[7]and[8] respectively. Readers 
primarily interested in the main results of this work are advised 
to read Sections [2j[5j[7j and[8] 


2. Video Processing Algorithms (VPAs) 


The Video Processing Algorithms (VPAs; e.g., Provost et al. 
|[2007[ ) are responsible for the science-data acquisition and pro¬ 
cessing, including object detection in the SM CCDs. Object de¬ 
tection has two branches: one for saturated and one for non- 
saturated objects. For Gaia, saturation of stellar images in the 
SM CCDs sets in for objects brighter than G ~ 12 ma^] The 
saturated-object-detection branch, based on “extremity match¬ 
ing” in Airbus Defence & Space terminology, has limited free¬ 
dom for user configuration and is outside the scope of this work. 
The non-saturated-object-detection branch, on the other hand, 
has a significant number of user-configurable parameters leaving 
ample room for scientific optimisation. As a result of real-time 
constraints in high-density fields which cannot be met with a 
software implementation, this branch is primarily implemented 
in hardware - through field-programmable gate arrays - and the 
processing can roughly be decomp osed into two modules: pre¬ 
processing of raw SM data (Section [O]), fo llowed by the actual 
non-saturated-object detection (Section |2T2|). 


1 Gaia’s G magnitude refers to the unfiltered, white-light response 

of the astrometric CCDs combined with the telescope. Its relation to 


canonical filter systems is addressed in Jordi et al. ( |2010| ). 
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Fig. 1. Each SM sample under scrutiny, itself composed of 2 x 2 
pixels, has a so-called working window, centred on it, associated 
with it. Object detection uses the 5 x 5-samples working window 
(not shown) for background subtraction and the 3 x 3-samples 
working window (depicted here for sample i,j = 1,1) for shape 
assessment of detections. The three-dimensional summed-flux 
/ shape vectors h and v contain, respectively, the along-scan- 
integrated (AL) and across-scan-integrated (AC) sum of the 
working-window background-subtracted flux values F,-y, in LSB 
units (Equation [TJ. The total, background-subtracted flux F in 
the working window equals F = vq + Vi + V 2 = ho + h\ + h^- 


2.1. Pre-processing of raw SM data 

Raw SM samples, composed of 2 x 2 hardware-binned pix¬ 
els, are continuously read and temporarily stored in a mov¬ 
ing buffer inside the VPU covering several hundred TDI lines. 
The pre-processing step identifies, through a user-defined mask, 
dead columns and interpolates SM flux values in such cases 
from neighbouring samples. The pre-processing also checks the 
raw SM data for saturated samples, allowing the VPAs to en¬ 
ter either the saturated-object-detection or the non-saturated- 
object-detection branch. Finally, any sample which is not sat¬ 
urated has a linear flux correction performed on it to ac¬ 
count for dark-signal non-uniformity and column-response non¬ 
uniformity (pixel-response non-uniformity integrated over a 
CCD column). Effectively, the next step in the process, detection 
of non-saturated objects, only applies to non-saturated samples 
which have not been dead-column corrected. 

2.2. Detection of non-saturated objects 

The detection part of the algorithms essentially searches for lo¬ 
cal maxima of flux, then analyses the shape of these local max¬ 
ima, subsequently interprets from this shape what type of ob¬ 
ject it is - faint star, prompt-particle event (PPE), or ripple - 
and finally applies a flux thresholding on the local maxima (see 
also Section [2~4]) . This logic may seem simple and sub-optimal - 
compared to more sophisticated, commonly-used packages such 
as SExtractor (Bertin & Arnoutsj [l996| ) - but this is an unavoid¬ 
able result of the (forced) choice of a hardware implementation. 

To detect and analyse local maxima, the VPAs sequentially 
process all samples in the moving VPU buffer containing the 
pre-processed SM samples (“continuous, full-frame SM data 
stream”). Each sample under scrutiny has a so-called working 
window, a square, finite grid of SM samples centred on the sam¬ 
ple of interest, associated with it (Figure [T]). 

The first step in the processing of each sample of interest 
is background determination. The sky background is estimated 
by default as the 5 th -lowest flux value from the 16 samples 
composing the outer ring of the 5 x 5-samples working win¬ 


dow. This background flux value is subtracted from the sample 
to give a background-corrected flux. On-board Gaia, fluxes are 
recorded on a 16-bit analogue-to-digital scale, referred to as LSB 
(Least Significant Bit) units; the nominal conversion gain equals 
0.2566 LSB per electron. 

The second part of the detection uses a smaller, 3x3- 
samples, working window (Figure [T}. The VPAs check for a 
local maximum of flux in this window, centred in our nota¬ 
tion on (/, j) = (1,1), by first calculating two three-dimensional 
summed-flux / shape vectors h and v (for horizontal and vertical, 
respectively): 

2 

hj = Yj p ij tor./= 0, 1,2; 
i =0 

(1) 

2 

Vj = y Fjj for / = 0,1,2, 

7=0 

where F t j denotes the background-subtracted flux of sample 
(/, j) in LSBs; the TDI-coordinate associated with index i is of¬ 
ten referred to as along-scan direction (—>), whereas the CCD- 
column coordinate associated with index j is often referred to 
as across-scan direction (f). The total, background-subtracted 
flux F in the 3 x 3-samples working window is calculated as 
F = Vo + V\ + V 2 (= ho + h\ + ii 2 ). A local maximum is defined as: 

Vi > Vo A Vi > V 2 ; 

A (2) 

h\ > ho A h\> ti 2 , 

where A denotes the ’’logical AND” operator. The vectors v and 
h describe the overall shape of the local maximum in the along- 
and across-scan directions, respectively: if h\ is much larger than 
ho and then the detection has a narrow peak in intensity in the 
across-scan direction, whereas if h\ is approximately equal to ho 
and /* 2 , then the object’s Point-Spread Function (PSF) is rather 
flat (broad) in the across-scan direction. Similar arguments hold 
for v and the along-scan direction. The shape vectors h and v are 
hence used on board to distinguish between three different object 
types. Since the implementation in the VPA detection hardware 
is primarily based on signed 64-bit integer operations, we need 
to define the operators: 

( 0 if v < 0; 

[x] n = \ x if 0<x<2 n -l; (3) 

{2 n -l if 2 n - l < x, 

denoting saturation of v to n bits, and 

(4 = x/2 n , (4) 

denoting truncation of x to n bits (truncation refers to elimina¬ 
tion of the n least significant bits, which is equivalent to integer 
division by 2 n ). In general, the truncation and saturation oper¬ 
ators are used on board to control under- and overflow situa¬ 
tions and to allow casting variables into several integer types, 
for instance unsigned 32-bit integers and signed 64-bit inte¬ 
gers. The actual shape discrimination applied on board is user- 
configurable through 2x5 = 10 so-called rejection parameters, 
denoted (< 2 , b , c, d , ^)hf and (« a , b , c, J, e)uj, which are signed in¬ 
tegers in the range [-32768, +32767]. Objects which satisfy: 

[(([^0 + <2HF]l8 • [/l2 + &HF]l8)4 ’ c Hf)s]32 

< [([(^>2 + ^Hf]i 8 + <?Hf)4] 32 (5) 
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are labeled as (sharply-peaked, i.e., with a high spatial fre¬ 
quency, or HF) “prompt-particle event” in the across-scan di¬ 
rection, while objects which satisfy: 

[(([ho + < 2 LF]l 8 • [/l 2 + &Lf]i8)4 ‘ Clf)s]32 

> [([(^)2 + ^LF]i8 + ^Lf)4-] 3 2 (6) 

are labeled as (broadly-peaked, i.e., with a low spatial frequency, 
or LF) “ripple” in the across-scan direction (roughly reminiscent 
of a higher-order diffraction maximum in a PSF). Objects which 
violate both conditions, which means with a PSF which is nei¬ 
ther too peaked nor too broad in the across-scan direction, are 
labeled as “faint star” in the across-scan direction, where “faint” 
refers to non-saturated. 

In a plot of ho/F versus h 2 /F (Figure [2j also referred to 
as rejection plot), the above inequalities define two hyperbolic 
curves for a fixed value of flux F. The 10 rejection parameters 
determine the shape and position of these hyperbolic curves for 
a fixed value of F; more generally, when considering the three- 
dimensional space of ho/F versus h 2 /F versus F, the above in¬ 
equalities define two hyperbolic surfaces. 

The above discussion, and in particular Equations 0 and 
([6]), is focused on the horizontal shape vector h applicable to the 
across-scan direction. There are similar criteria to Equations 0 - 
^ for prompt-particle-event and ripple definitions in the along- 
scan direction based on the vertical v vector. A genuine faint- 
star detection then requires a faint-star classification along 
scan (based on v and (a,b,c,d,e) hf,-> and (a,b,c,d,e) lf,->) 
and a faint-star classification across scan (based on h and 
(« a , b , c, d, £)hf,t and (< a , b , c, d, ^)lf,t)- 

The last step in the object detection is a flux-thresholding 
stage. This step essentially defines Gaia’s faint limit (nomi¬ 
nally G = 20 mag). Since the thresholding works on on-board 
(background-subtracted) fluxes collected in the SM CCD, its 
functional default value is a (non-intuitive) 110 LSB. 

All in all, there are 2 (—>, f) x 2 (HF, LF) x 5 (< a , b , c, d, e) = 
20 free parameters which govern the classification of local 
maxima into faint stars, ripples, prompt-particle events. The 
functional-baseline values for these rejection parameters are not 
the outcome of a detailed scientific optimisation but are based on 
limited simulations and laboratory data and essentially ensure 
that “normal, single stars” are detected while extremely sharp, 
elongated, and broad cosmic rays and Solar protons are rejected. 
In reality, however, prompt-particle events, and also stars with 
their various multiplicity configurations, take a wide variety of 
(PSF) shapes and wanted objects and unwanted objects are re¬ 
ally mixed populations in (ho/F, hz/F, F)- and (vo /F, V 2 /F, F)- 
space. This study aims to establish scientifically-optimum sepa¬ 
ration surfaces in these spaces. 


2.3. Our VPA emulation 


We have emulated the VPA object detection of non-saturated ob¬ 
jects described in Section |Z2| in a standalone piece of software. 
It covers background subtraction, application of the rejection 
equations ([5}j6]) (both along and across scan), and flux threshold¬ 
ing, but, since it is irrelevant in the scope of this inve stigation, 
not the pre-processing stage described in Section 2T We have 
successfully tested our emulation against the Airbus Defence & 
Space VPA prototype which has been integrated into the Gaia 
Instrument and Basic I mage Simulator (GIBIS; Babusi aux 2005 1 
Babusiaux et al. 2011| and against a stand-alone version of this 
prototype running, in a controlled environment with validation 
test cases, in Gaia’s science operations centre in Spain. 


2.4. From detection to catalogue completeness 

Although the derivation of Gaia’s selection function and cata¬ 
logue completeness is outside the scope of this paper, we pro¬ 
vide a short summary of the observation process of objects with 
the aim to warn the reader that detection and observation prob¬ 
ability are distinct quantities. Schematically speaking, an object 
(transit) has to survive all of the following steps to contribute to 
the final Gaia catalogue (see Figure [3]): 


1 . 


SM detection: 
Section 


2.2 


the three-step process described in 
consisting of (i) the search for local max¬ 
ima of flux, (ii) the assessment of the shape of these local 
maxima allowing object classification through application 
of the rejection equations, and (iii) application of a flux 
threshold. An object that survives these three steps is 
denoted as “detected”; 

2. Pre-selection: every TDI line, all detections are first merged 
with the user-defined “virtual objects” required for calibra¬ 
tion and then sorted in priority (flux). This list is then subject 
to an object-flow-limitation condition allowing only the five 
highest-priority objects to pass to the next step. The associ¬ 
ated limiting density is ~3 million objects per square degree; 

3. Resource allocation: after merging the lists of pre-selected 
objects from both telescopes (SMI and SM2), a final selec¬ 
tion of objects to be followed throughout the Astrometric 
Field (AF) is made. The AF CCDs are not read out full frame 
but only small areas (“windows”) around objects of interest 
are read out. The window size is 12 pixels in the across-scan 
direction and varies from 18 pixels in the along-scan direc¬ 
tion for G < 16 mag to 12 pixels for G > 16 mag. For stars 
fainter than G = 13 mag, the 12 pixels in the across-scan 
direction are normally binned into one sample during read¬ 
out leading to effectively one-dimensional data. At each TDI 
line, the VPAs can simultaneously handle W = 20 samples 
(“resources” in Airbus Defence & Space terminology) in 
the read-out register. Depending on the particular, instanta¬ 
neous configuration of detected-object magnitudes, this cor¬ 
responds to a limiting object of at most ~1 million objects 
per square degree. The VPA uses a prioritised allocation of 
resources to bright detections, meaning that, when there is a 
shortage of windows, faint stars will be sacrificed to allow 
assigning a window to a bright(er), i.e., high(er)-priority, ob¬ 
ject. In short, in dense areas, not all detected objects will 
receive a resource (window); 

4. AF1 confirmation: the VPAs implement, following the de¬ 
tection stage in the SM CCDs, a confirmation stage in the 
first AF strip (AF1). This stage has two purposes, namely 
(i) to confirm, by re-detection of the object using the AF1 
samples, the presence of the object detected in SM, and (ii) 
to estimate the velocities of a subset of the stars to produce 
measurements for the closed-loop spacecraft attitude and 
control sub-system. The confirmation essentially involves a 
pre-processing of raw AF1 samples similar to the SM pre¬ 
processing, then constructs a working window around the 
expected position of the object obtained from forward prop¬ 
agation from the SM detection, then performs background 
estimation similar to the SM process, and finally runs a local- 
maximum detection similar to the SM concept. If a local 
maximum is found and if the background-subtracted AF1 
flux is consistent with the background-subtracted SM flux, 
where “consistent” is defined through user-configurable cri¬ 
teria, then the object is confirmed and considered for fur¬ 
ther observation throughout the focal plane. The confirma¬ 
tion criterion is hence purely flux-based: the “PSF shape” of 
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Fig. 2. Left p anel: example across-scan (AC) rejection plot, based on the along-scan-integrated flux vector h, for 50,000 single 
stars (Section with magnitudes between G = 19.5 and 20 mag (so that typical flux values are F ~ 140 LSB). Stars with a 
symmetric PSF which are centred in an SM sample fall on the diagonal 1:1 relation. Stars with sharp PSFs fall close to the origin 
whereas stars with broad PSFs move diagonally up towards the vertex ho/F = /^/F = 1/3. When, for a given PSF size, the PSF 
centring inside the sample is varied, objects move on a hyperbolic curve either towards the top left or towards the bottom right. In 
the absence of Poisson noise, stars with different brightnesses occupy the same hyperbolic curves. The effect of Poisson noise is 
to broaden this curve into a hyperbolically-shaped cloud; the spread is larger for faint stars since Poisson noise is relatively more 
important for faint than for bright stars. This effect, combined with background-subtraction errors, can lead to negative ho and/or 
h 2 values and hence negative data points, in particular for faint stars. Due to the finite number of LSB units in the working window 
(F ~ 140 LSB), discretisation effects in ho/F and h 2 /F can be seen in the data. Right panel: example across-scan rejection plot with 
high- and low-frequency curves associated with, respectively, Equations ([5]) and ^ for fluxes F associated with magnitudes G = 12 
(red), 18 (magenta), 19 (green), and 20 (blue) mag. The saturation and truncation operators from, respectively, Equations 0 and 
0 have not been included in the curves; they therefore merely serve illustration purposes. The curves, defined through the user- 
defined VPA parameters ( a , b , c, d , £)hf,t and ( a , b , c, d , ^)lf,t which have here - for illustration - been set to the functional-baseline 
values, are flux dependent although the effect of flux on the curves is minimal for bright stars (the curves essentially superimpose 
for stars brighter than G ~ 15 mag). The upper set of curves is referred to as low frequency (LF) whereas the lower set of curves 
is referred to as high frequency (HF). Objects above the upper curve are labeled “ripple” while objects below the lower curve are 
labeled “prompt-particle event”; objects in between the lower and upper curves - for the applicable flux level - are labeled “faint 
star”. Gaia’s on-board object detection is based on an along-scan rejection plot using shape vector v (not shown) and an across-scan 
rejection plot using shape vector h (shown here). The domain of possible ho/F and fe/F values is limited by the definition of a 
local maximum in the VPAs: since a local maximum is defined as h\ > ho and h\ > h 2 (Equation^, the maximum values that ho/F 
and h 2 /F can (asymptotically) take are 1/3 each. Similarly, the maximum value that each of them can (asymptotically) take is 1/2, 
with the other then (asymptotically) taking the value 0. More generally, Equation ^ induces boundaries on the rejection plot (solid 
lines), below which a data point must fall to obey the VPA local-maximum definition. 


the confirmed object is not tested. Clearly, since the confir¬ 
mation stage is not 100% perfect, there is a risk of a detected 
object to be adversely killed by the confirmation step; 

5. AF2-9 acquisition: the acquisition of the bulk astrometric 
window data in CCD strips AF2-AF9 is not guaranteed to 
be successful. The scanning-law-induced across-scan motion 
of objects, for instance, may cause them to drift out of the 
CCD in the across-scan direction. There is also a finite prob¬ 
ability that the window of a star is polluted, for instance by 
straylight caused by very bright stars or planets or by an in¬ 
jected line of charge used for radiation-damage mitigation. 
Similarly, windows can be affected, for instance, by a re¬ 
duced CCD integration time (activated TDI gate) induced by 
a simultaneously-transiting bright star or by a dead column; 

6. On-board storage and deletion: after the focal-plane transit, 
the window data are collected into star packets which are 
temporarily stored into the on-board solid-state mass mem¬ 


ory before being transmitted to ground. The downlink to 
ground uses a prioritised scheme. Since the mass memory 
has a finite size, it occasionally fills up necessitating acti¬ 
vation of an on-board deletion scheme. This scheme is also 
prioritised. So, even if a detected star manages to get all its 
window data properly collected into a star packet, there is a 
finite probability that the data gets deleted on board; 

7. Ground reception: finally, even when a star packet is trans¬ 
mitted to ground there is a small but finite probability that 
it is lost as a result of unplanned ground-station outages or 
unrecoverable transmission(-frame) anomalies. 


This summary clearly demonstrates that near-perfect object de¬ 
tection, being the first element in the chain, is a pre-requisite but 
not a guarantee for a high observation probability. 
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Fig. 3. Schematic summary of steps involved in the observation 
process, i.e., detection, selection, confirmation, acquisition, and 
survival of objects. Steps 1 and 2 have been implemented in 
hardware. 


of the Gaussian (weight 55%) and the Lorentzian (weight 45%), 
which is often used as approximation to a Voigt function, i.e., the 
convolution of a Lorentzian with a Gaussian. Such a sum, after 
parameter tuning, actually provides a remarkabl^good approxi¬ 
mation to the individual LSFs. Since the SM LSFs do show small 
asymmetries, a more suitable, empirically-motivated, parametri- 
sation of the LSF in SM is a summation of a Gaussian and 
a Lorentzian LSF including LSF asymmetry (e.g., 

[Brauns| |2008| ): 


Stancik & 


LSF(v) = / • L(v) + (1 - /) • G(v), 


(7) 


where: 


L(v) = 


[2A]/[tt y(v)] 

1 + 4[(v - v 0 )/y(v )] 2 


is a Lorentzian, 


( 8 ) 


A /41n2 

G<v)= ^jV— exp 


-41n2 


V-Vo 

y(v) 


is a Gaussian, (9) 


3. Simulated data sets 

In order to investigate the performance of the on-board detec¬ 
tion algorithms on various object categories, we need represen¬ 
tative image libraries of various types of objects. As explained in 
Section [2] they should cover the non-saturated-object regime in 
the SM CCDs. Since saturation in SM starts at G ~ 12 mag, we 
decided to use the range G = 12.5-20 mag, keeping 0.5 mag as 
margin. We should stress at this stage that the precise bright-star 
limit adopted in this study is not an important parameter: the 
flux dependence of the rejection equations W at the bright 
end (G ^ 15 mag) is very weak (see also the curves in Figure [2]) 
which means that if we optimise the detection including stars at 
G = 12.5 mag, this solution also applies to any brighter stars 
(provided they do not saturate). 


3.1. Single stars 


For single stars, we need a library of two-dimensional images 
covering the magnitude range G = 12.5-20 mag and, in view 
of the VPA background subtraction, covering at least 7x7 SM 
samples (i.e., 14 x 14 CCD pixels). 

Gaia’s optical design allows near-diffraction-limited imag¬ 
ing: the system wave-front error in the astrometric field equals 
~50 nm RMS so the Strehl ratio exceeds 80% for A > 665 nm 
(i.e., unreddened mid-K and later spectral types), applicable to 
the majority of Gaia targets. Gaia’s PSF is hence symmetric to 
first order and PSF asymmetries, caused by optical aberrations, 
are modest and mainly visible in the (far) wings of the PSF. In 
the SM fields of view, at the edges of the telescope’s fields of 
view, the average wave-front error is ~63 nm RMS, which means 
that diffraction-limited imaging is only achieved for the reddest 
objects (A > 838 nm, i.e., reddened M-type stars). Figure [4] 
shows 448 predicted SM along-scan LSFs. They have been 
obtained through full-fledged, realistic, time-consuming simu¬ 
lations combining 14 SM wavefront-error maps (delivered by 
Airbus Defence & Space) with 16 stellar spectral-energy distri¬ 
butions from Pickles s library ( Pickles 1998 spectral types BIV, 


AOV, A3V, A5V, F2V, F 6 V, F 8 V, G2V, K3V, MOV, M 6 V, G 8 III, 
K3III, MOIII, M7III, and BOI) with two values of interstellar ex¬ 
tinction (unreddened and A 550 n m = 5 mag). Overplotted, for ref¬ 
erence, are a Gaussian (red) and a Lorentzian (green); both have 
the same FWHM, corresponding to cr = 1.0 AL pixel for the 
Gaussian. Also overplotted for reference (in cyan) is the sum 


and 


7(v) = 


_2yo_ 

1 + exp [a • (v - vq)] 


is a sigmoid, 


( 10 ) 


where v is the along-scan pixel coordinate, 0 < f < 1 is 
the fraction of the Lorentzian character contributing to the LSF 
(/ = 0 = Gaussian and / = 1 = Lorentzian), A is the area (in¬ 
tensity) of the LSF, and Vo is the mean (centre) position of the 
LSF. The parameter a describes LSF asymmetry: negative val¬ 
ues skew the LSF towards higher values of v, while positive val¬ 
ues skew the LSF towards lower values of v. When a = 0, y(v) in 
Equation ( fT()| ) reduces to yo and the LSF is a standard, symmet¬ 
ric Gaussian or Lorentzian with a constant width. The parameter 
yo denotes the FWHM of the Gaussian or Lorentzian for a = 0 
(for the Gaussian, we have yo = 2 V 21 n 2 cr when a = 0 ). The 
particular sigmoidal functional form of y(v) in Equation ff0| ) is 
advantageous since the width asymptotically approaches upper 
and lower bounds. 

The LSF model in Equation ([7]) applies not only well to the 
along-scan direction but also to the across-scan direction. One 
peculiar aspect relevant (only) in the across-scan LSF is the fact 


2 For spectral LSFs, a Voigt profile can be physically understood 
realising that the Gaussian refers to Doppler broadening while the 
Lorentzian refers to radiation damping and collisional (pressure) broad¬ 
ening. Voigt profiles also feature frequently in crystallography because 
X-ray diffraction profiles are well represented by (pseudo-)Voi gt pro¬ 
files (e.g., 


van de Hulst & Reesinck |1947[ |Wertheim et al.[ |1974[ 
.| |1982| > since particle-size broadening 


[Langford] 119781 |de Keijser et al.| 


corresponds to a Lorentzian and instrumental contributions and lattice- 
strain broadening can be represented by a Gaussian. It is therefore not 
surprising that also for optical LSFs, where the physical expectation 
is a convolved Fraunhofer-diffraction profile, Voigt profiles provide a 
convenient representation. Gaia has a rectangular aperture and an asso¬ 
ciated monochromatic Fraunhofer diffraction pattern described by the 
square of a sine function: I A oc sin 2 (a)/a 2 , with a = [nDv]/[FA], 
with D the aperture dimension (1.45 m along and 0.50 m across scan), 
F = 35 m the focal length, and v the spatial coordinate in the focal plane 
/ on the CCD. This profile, after spectral superposition, is convolved 
with Gaussian and boxcar functions representing various smearing con¬ 
tributors caused by spacecraft attitude jitter during the CCD integration, 
the scanning-law induced differences between the optical and electronic 
speed of the image, the detector modulation transfer function which in¬ 
cludes charge diffusion of electrons inside the CCD, optical distortions, 
the electrodes / phases corresponding to the TDI integration stages in a 
pixel, and pixel binning. 
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Fig. 5. Histograms of /, a , vo, and cr (with yo = 2 V21n2 cr) obtained when fitting the along-scan (top) or across-scan (bottom) LSF 
model (Equation]^]) to the 448 LSFs resulting from combining 14 SM CCDs, 16 stellar spectral-energy distributions, and two values 
of interstellar extinction (see Figure]?]). The normalisation constant A has been frozen to unity in all fits. The across-scan direction 
refers to average across-scan motion. Since the Gaia pixel ratio (10 x 30 jum 2 , i.e., 1 : 3) effectively cancels the Gaia aperture ratio 
(1.45 x 0.50 m 2 , i.e., 3 : 1), the along- and across-scan LSFs have roughly the same size (cr ~ 1 pixel). 


that it varies in size (and shape) with time: stars, during their 
transit of the focal plane, have a small yet finite across-scan mo¬ 
tion caused by the precession of the spin axis associated with 
the scanning law of the sky. The transverse speed of objects in 
the focal plane hence varies sinusoidally with a period equal 
to the satellite spin period (6 hours) and with an amplitude of 
173 mas s -1 (milli-arcsec s -1 ), corresponding to 2.80 AC pixels 
over the 2900 integrating TDI lines in the SM CCDs. 

Since we need to simulate and process hundreds of thou¬ 
sands of two-dimensional PSFs with random centre positions 
and noise configurations quickly, a parametrisation of the SM 
along- and across-scan LSFs using two sets of five parameters 
(/, A, a, vo, and 70 ; Figure [5} provides a convenient trade-off 
between realism and speed of our simulations. We thus simulate 
750,000 single-star images as follows: 

1. parametrise the along-scan LSFs from the 448-item full- 
fledged-simulation library by fitting, for each LSF, four 
free parameters (/, a, vo, and 70 ) to the LSF model from 
Equation 0 ; we freeze A to unity in all fits to guarantee flux 
normalisation; 

2. do the same but then across scan. We use three full-fledged 
PSF libraries with 448 LSFs, (1) without across-scan motion 
(0 mas s -1 ), (2) with the average across-scan motion (173 • 
2 /71 =110 mas s -1 ), and (3) with the maximum across-scan 
motion (173 mas s -1 ); 

3. then repeat the following steps 750,000 times: 

4. select a random SM CCD, a random spectral type, and a 
random value of the interstellar extinction; in addition, se¬ 
lect a random value of the across-scan motion with weights 
\ ■ [sin - 1 (2/tt)]/[tt/ 2] = 0.2197 for set (1), 0.5 for set (2), 
and \ ■ [(tt/ 2) - sin -1 ( 2 / 7 r)]/[ 7 r/ 2 ] = 0.2803 for set (3); 


5. get the five along-scan LSF fit parameters /, A = 1, or, vo, 
and 70 ; 

6 . do the same but then across scan; 

7. make a two-dimensional PSF, simply by multiplying the 
along-scan LSF with the across-scan LSF; 

8 . select a random sub-pixel position of the centre of the star, 
in two dimensions (along and across scan); 

9. select a random magnitude between G = 12.5 and 20 mag. 
In practice, we draw 100,000 stars between G = 12.5 and 
13.5 mag, 100,000 stars between G = 13.5 and 14.5 mag, 
..., and 50,000 stars between G = 19.5 and 20 mag. The 
total number of objects is hence 750,000 exactly; 

10 . add sky background, corresponding to a typical surface 
brightness of V = 22.5 mag arcsec -2 (this corresponds to 
a background level of 0.63 electrons per pixel after 2.85 sec¬ 
onds of integration on the SM CCD); 

11. add random Poisson noise, both on the object and on the sky- 
background counts; 

12. project (“bin”) the PSF image on the SM samples (composed 
of 2 x 2 CCD pixels); 

13. add a random total detection noise on each sample 
(10.9 electrons RMS per sample for the SM CCDs, based 
on ground-based payload-performance testing); 

14. convert the electron counts to LSB units. 

We can ignore saturation, both at CCD-pixel-full-well and at 
CCD-charge-handling-capacity level, since our simulated stars, 
by construction, do not saturate (recall that saturared samples 
follow a different branch of the on-board detection software, 
“extremity matching” in Airbus Defence & Space terminology). 
In the above process, to avoid border effects, we do not limit 
ourselves to 7 x 7 SM samples: each simulated image covers 
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Along-scan pixels 


Fig. 4. Star-Mapper (SM) along-scan (AL) line-spread functions 
for all 448 combinations of 14 CCDs, 16 stellar spectral-energy 
distributions, and two values of interstellar extinction. Since the 
LSF size is primarily determined by the (local) optical quality of 
the telescope, which varies over the SM field of view (the wave- 
front error varies between 40 and 105 nm), the curves cluster 
in various “families” (see also Figure [5]). The curves do not in¬ 
clude the effect of the on-chip binning of the SM pixels in 2 x 2 
samples. Overplotted, for reference, are a Lorentzian in green 
(Equation [5]), a Gaussian in red (Equation [9]), and a sum of a 
Gaussian with weight 55% and a Lorentzian with weight 45% in 
cyan (Equation [7]). 


40 x 40 samples (80 x 80 pixels), which is then fed to the detec¬ 
tion algorithm for object finding. 

This recipe, clearly, does not provide a single-star library 
which is compatible with the astrophysical distribution of spec¬ 
tral types in the Gaia sky (see, e.g., |Robin etak] |2012| for a 
review of the expected spectral-type statistics and properties of 
the Gaia catalogue). But such a library is also not needed for our 
purposes: we aim to optimise the detection of all possible (CCD, 
spectral type, extinction) configurations, regardless of their exis¬ 
tential probability, since we do not want Gaia’s on-board detec¬ 
tion to induce any biases in the selection of stars and hence in 
the final catalogue. 


AG = 0-5 mag (with the added constraint that the secondary is 
brighter than G = 21 mag), a random orientation in the range 
a = 0°-360°, a random separation in the range p = 0-354 mas, 
and a random sub-pixel centring. The maximum separation has 
been chosen to correspond to half of the (faint-star) along-scan 
window size in the astrometric field (i.e., 6 AL pixels) since ob¬ 
jects separated by larger angles will each receive their own win¬ 
dow and can hence be considered as single stars. 

As for single stars, we ignore saturation and avoid border 
effects by simulating oversized images covering 80x80 samples, 
which are then fed to the detection algorithm for object finding. 
In general, one double star simulated as described above can lead 
to either 1, 2, 3, or 4 local maxima: 

- one local maximum typically results for double stars with 
small separations; 

- two local maxima typically result in cases of intermediate to 
large separations, allowing both components to be detected 
individually; 

- three and four local maxima can result if both components 
generate their own local maximum and if at least the primary 
component is bright and the separation is preferably not too 
large: the intersection(s) of the along-scan diffraction wing 
of one star with the across-scan diffraction wing of the other 
star (and/or vice versa) can yield a third (and/or fourth) local 
maximum. 


We discriminate between double stars which generate one local 
maximum (symbolically ** —> *) and double stars which gen¬ 
erate two local maxima (** —> **). We construct two double¬ 
star data sets by simulating double stars in an open loop and as¬ 
signing them either to the one-local-maximum or the two-local- 
maxima data set (or ignoring them in case of no local maxi¬ 
mum) and repeating this exercise until both data sets have ex¬ 
actly 750,000 entries (the ** —> ** data set has thus 375,000 
underlying double stars whereas the ** —> * data set has 750,000 
underlying double stars). 

Again, as for single stars, this recipe, clearly, does not pro¬ 
vide a double-star library which is compatible with the (pair¬ 
ing) probability of physical binaries in the Gaia sky (see, e.g., 
Arenou, 2011 [ for a review of the expected binary-star statistics 


and properties of the Gaia double- and multi ple-s tar catalogue). 
But, for the same reasons as set out in Section[3J]for single stars, 
this is also not required or desired for our purposes. 


3.2. Double stars 

For double star^] our requirements do not differ from those for 
single stars. We therefore follow the same recipe, except that 
we randomly select two objects (two PSFs) in each step (i.e., 
for each image). In practice, we simulate the primary compo¬ 
nent along the lines set out in Section |3.1| The primary com¬ 
ponent is, by definition, the brightest and falls in the range 
G = 12.5 to 21 mag; we go one magnitude fainter than for sin¬ 
gle stars since an unresolved, equal-brightness double star will 
be 0.75 mag brighter than each component separately. Each sim¬ 
ulated secondary component shares the CCD, the across-scan 
motion, and the interstellar extinction with its primary compan¬ 
ion but has a ra ndom spectral type chosen among the 16 types 
listed in Section [3T] a random magnitude difference in the range 

3 From now on, we will exclusively use the word “double star” to 
denote both optical double and (physical) binary stars; we do not treat 
higher-order multiple stars. In particular in dense areas, a significant 
fraction of Gaia double stars will not be binaries but optical doubles. 


3.3. Ghosts 

When feeding the single-star images described in Section |3.1| 
to the detection algorithm, it is not rare to retrieve multiple lo¬ 
cal maxima. Figure [6] shows an example of a single star which 
has five associated local maxima, one of the star core itself and 
four spurious ones in the (far) wings, from now on referred to as 
ghosts. This can happen since the (rather flat) PSF wing, some 
distance from the star centre, either along or across scan, can 
cause a local configuration of flux values in the 3 x 3-samples 
working window which satisfy the VPA local-maximum cri¬ 
teria on the PSF shape. Generally, such ghosts are found at 
some distance from the PSF core, where the PSF flattens out 
and where flux levels are low. They are hence typically faint. 
In our sample of 750,000 single stars (Section [3TT] ), we found 
326,596 ghosts. Figure [7] shows their properties. The majority 
of the ghosts (73%) is associated with the 100,000 bright stars 
in the bin G e [12.5,13.5] mag. The faintest star which has a 
ghost brighter than the VPA flux threshold at G = 20 mag is a 
G = 16.3-mag star. The ghosts vary in brightness from G « 19 to 
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Fig. 7. Properties of all 326,596 ghosts brighter than G = 20 mag originating from single stars in the range G = 12.5-20 mag. Panel 
1\ histogram of the G magnitude of the “parent stars” responsible for the ghosts. The faintest parent star has G = 16.3 mag. Panel 
2: histogram of the G magnitude of the ghosts. Panel 3: average number of ghosts that a star of magnitude G generates. See also 
Figure [TT] 



Fig. 6. Example of the Sky-Mapper (SM) PSF of a single, bright 
star (G ~ 13 mag) which has five associated local maxima, one 
of the star itself and four spurious ghosts in the wings. The five 
red squares indicate the 3 x 3-samples VPA working windows 
of the five local maxima. The horizontal axis denotes along- 
scan (AL) SM sample while the vertical axis denotes across-scan 
(AC) SM sample. The colour coding is logarithmic and shows 
the sample flux in LSB after on-board background subtraction. 
Single stars can have associated ghosts out to a few dozen SM 
samples from the star centre (recall that our single-star simula¬ 
tions are based on a 40 x 40-samples grid). 


20 mag, with bright ones being (very) rare and faint ones being 
most common. 

Ghosts which pass the thresholding stage are in principle 
harmful since they do compete in the window assignment (re¬ 
source allocation) with real stars (Section [24]). We therefore fol¬ 
low what happens to ghosts when we optimise the rejection pa¬ 
rameters by making a special object category labeled “ghosts”, 
allowing to evaluate the performance of the optimised set of VPA 
parame ters on this set of objects. This is further discussed in 
Section l64l 


t 2 


■ GCR protons - Creme96 (creme.isde.vanderbilt.edu) 

• GCR Helium ions - Creme96 (creme.isde.vanderbilt.edu) 

■ Solar protons - ESP (www.spenvis.oma.be) 
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Fig. 8. Energy distribution at L2 and at Solar maximum (after 
spacecraft shielding) for each considered type of incoming parti¬ 
cle: Galactic-cosmic-ray proton (solid) and Helium nucleus (dot¬ 
ted), and Solar proton (dashed). The energy of each simulated 
prompt-particle event is randomly drawn from the respective dis¬ 
tributions. 


3.4. Galactic cosmic rays and Solar protons 


Gaia’s CCDs are not only sensitive to photons but also to ener¬ 
getic particles (radiation) that can lead to spurious events and, 
ultimately, unwanted detection^] As mentioned earlier, it is thus 
critical to discriminate prompt-particle events from astronomical 
sources at the detection stage. We hence simulate catalogues of 


4 Particles with energies less than ~100 MeV are also responsible 
for displacement damage through generation of point defects (“traps”) 
in the CCD Silicon crystal lattice. These defects can trap and effectively 
delay electrons during their transfer from one pixel to the next, leading 
to an image distortion and decrease in signal-to-noise ratio. Implications 
of this charge-transfer in efficiency for the Gaia on-g r ound data process - 
ing are discussed in, e.g., Prod’homme et al. ( |2012[ ); Holl et al.] ( |2012| >. 


9 




























J.H.J. de Bruijne et al.: Detecting stars, galaxies, and asteroids with Gaia 



GCR Helium ion 



0 5 10152025 


Solar proton 



0 5 10152025 


2000 

1750 

1500 

1250V 

QJ 

i/) 

1000 § 

o 

u 

750 

500 

250 

0 


Fig. 9. Examples of prompt-particle events for incoming parti¬ 
cles of different nature and energy as generated by our simula¬ 
tor. The events are chosen arbitrarily to represent their diversity 
in orientation, size, and brightness. Elongated events, such as the 
one depicted in the bottom centre thumbnail, are less likely to oc¬ 
cur since they need to pass through the CCD at a rather shallow 
angle. The most common events are circular (e.g., middle right 
thumbnail), with the incoming particle passing straight through 
the CCD. 


prompt-particle events representative of the Gaia CCD architec¬ 
ture and the radiation environment of the spacecraft. 

Gaia operates close to Solar maximum at the L2 Lagrangian 
point located 1.5 million km beyond the Earth and its radiation 
belts. The L2 (interplanetary) radiation environment can be con¬ 
sidered to be principally composed of Galactic Cosmic Rays (re¬ 
ferred to as GCRs in Figures 8] and [9]) and Solar particles: 

- Cosmic rays are high-energy particles (up to several GeV, 
cf. Figure [8}, generated mostly by supernovae, that are coin¬ 
cidentally passing through the Solar system. At the energies 
considered in this work, they are composed of approximately 
90% protons, 9% Helium ions, and 1% heavier ions. The in¬ 
coming flux of cosmic rays is rather continuous with a slight 
modulation by the Sun’s activity (minimum at Solar max¬ 
imum) and can be considered as a constant background of 
5 particles cm -2 s -1 . 

- Solar particles - essentially protons - are lower-energy par¬ 
ticles (from several eV to a few hundred MeV, cf. Figure [8]) 
emitted by the Sun during discrete magnetic reconnection 
events occurring at the Solar surface. The Solar-proton flux 
hence varies from close to zero during Solar-quiet times to 
extremely high fluxes, up to millions of protons cm -2 s -1 , 
during Solar flares. 

Generating representative catalogues of prompt-particle 
events requires the energy spectrum for each type of incoming 
particle at L2 during Solar maximum, accounting for spacecraft 
shielding. This can be obtained using standard on-line models 
and tools. We use the CREME96 model |Tylka et ak||l997| for 
cosmic rays and the SPace ENVironment Information System 
(SPENVIS) together with the Emission of Solar Protons (ESP) 
total-fluence model ( Xapso s et al.|[l9 99 ,2000) for Solar protons. 
Spacecraft shielding stops a significant fraction of the lower- 
energy particles (i.e., mostly the Solar protons). To account for 


the impact of shielding on each spectrum, we use the particle- 
transport facility of each tool and an Aluminium thickness value 
of 11 mm, corresponding to the average Al-equivalent shielding 
at the Gaia focal-plane assembly. The resulting spectra for each 
particle type are shown in Figure [8] and are used as input in our 
event simulation. 

Each prompt-particle-event image in our catalogue is gener¬ 
ated using code developed by A. Short (2006, private communi¬ 
cation) in support of GIBIS and validated against in-orbit XMM- 
EPIC MOS CCD data. To generate a single event, the main steps 
of the simulation consist of: 

1. Random generation of the particle energy following the input 
energy spectrum, sub-pixel position, and angle of incidence; 

2. Energy deposition (i.e., generation of free electrons) along 
the particle path through the CCD according to the Silicon 
stopping power applicable to the type of incident particle; 

3. Electron diffusion in the field-free (and depleted) CCD re- 
gion(s); 

4. Mapping of the electrons to the CCD pixels and image gen¬ 
eration. 

Our simulation takes into account the pixel architecture and 
geometry of the Gaia SM CCDs (normal-resistivity Silicon, 
10 x 30 /d m 2 pixels, 9 /im depletion depth, and 7 /im field-free 
thickness) and a nominal operating temperature of 163 K. 

We generate two catalogues, one for cosmic-ray events and 
one for Solar-proton events. Figure [9] shows examples of simu¬ 
lated events for each particle type. One event can lead to multiple 
detections (including no detections): our 2,602, 864 cosmic-ray 
images lead to 3, 884,976 detections (i.e., local maxima in the 
VPA), which means the average multiplication factor is 1.49, 
while our 1,195,992 solar-proton images lead to 1,611,882 de¬ 
tections (i.e., local maxima in the VPA), which means the av¬ 
erage multiplication factor is 1.35; this difference can be un¬ 
derstood since cosmic rays are typically elongated while Solar 
protons are typically more point-like. For both event types, we 
“only” use 750,000 randomly-selected local maxima in the VPA 
in our study (Section]?]). 

The statistical properties of our catalogues agree with the 
properties of similar catalogues which have been developed in¬ 
dependently by Airbus Defence & Space in 2008 in the frame 
of the Gaia project based on | Kirkpatrick] \\919) ; [Lomheim et al. 
( | 1990| ); [Dutton et al7| ( |1997|) . One notable feature of both sets of 
prompt-particle-event catalogues is the lack of faint events: the 
faintest detected event has G ~ 18.7-18.8 mag (~1,800-1,700 
electrons). This is not surprising, given the input energy distri¬ 
butions displayed in Figure [8] In addition, one should realise 
that faint events come either from (very-)high-energy particles, 
which are hardly decelarated when they interact with the Silicon 
and hence deposit only few free electrons, or from low-energy 
particles, which are totally absorbed but which can only free 
a limited number of electrons. In addition, particles ineracting 
with CCDs deposit most energy just before they come to a stop, 
which gives a hard cut-off at low energies. 

3.5. Unresolved galaxies 

Gaia will not only observe stars but will also encounter mil¬ 
lions ofpoorly-to-unresolved galaxies all over the sky ( |de Souza 
et al.| |2014| ). This unique dataset is a valuable by-product of 
the mission, and specific groups in the Gaia Data Processing 
and Analysis Consortium (DPAC) are in charge of developing 
strategies and the necessary software implementation for spec- 
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tral (Tsalmantza et al. 2009| ) and morphological ( Krone-Martins 
et al. |2013| ) studies of these objects. 


As Gaia is primarily a Galactic astrometry mission, we do 
not take galaxies into account for the optimisation of the rejec¬ 
tion parameters (Section^. However, it is important to study the 
impact of this optimisation on the detection of such objects, as 
this may have a direct impact on the scientific outcome of their 
study as well as on the strategies to be adopted for their analysis 
during the data processing. Thus, to assess the detection of un¬ 
resolved galaxies, we create a catalogue of synthetic galaxy pro¬ 
files covering two extreme cases: (i) pure de Vaucouleurs pro¬ 
files, representing pure classical galaxy bulges or elliptical ob¬ 
jects, and (ii) pure exponential profiles, representing pure galaxy 
disks. We have deliberately chosen not to include the most ex¬ 
treme case of galaxy profiles, representing active galactic nu¬ 
clei (AGNs), as their point-source-like profiles will be naturally 
detected by Gaia. The simulations have been performed with 
GIBIS, which simulates the de Vaucouleurs profiles using the 
effective radius Ry, corresponding to: 


Iv(f) oc exp| -7.67 


and the exponential profile using the disk scale length R E : 


I E (r) oc exp - 


Re 


( 11 ) 


( 12 ) 


The simulated profiles are circularly symmetric, as elliptical pro¬ 
files are equivalent to a circular profile of a smaller radius for 
detection purposes. They uniformly cover the parameter space 
with radii between 0.2 and 2.0 arcsec and integrated magnitudes 
from V = 14 to 20 mag, regardless of the physical relevance 
of each parameter combination (e.g., a fraction of this param- 
eter_space is not expected to be occupied by real galaxies; see 
de Souza et al., 2014| ). As generating GIBIS simulations is time 
consuming, the simulations have been performed arranging sev¬ 
eral profiles in the same image. The profiles have been arranged 
on a regular grid around galactic coordinates (/, b) = (40°, 52°). 
These coordinates have been chosen since - due to Gaia’s scan¬ 
ning law used in GIBIS - the satellite will perform 152 obser¬ 
vations with different transit angles around this position, making 
the analysis of the results less prone to statistical fluctuations. 
Considering each transit as an independent observation, a total 
of 179,056 observations have been simulated. Figure [T0| shows 
two examples of the resulting SM images. 


3.6. Asteroids 


Besides stars (Sections |3.1f[3.2| ) and unresolved galaxies 
(Section |3.5| ), Gaia will also observe a few hundred tho usan d 
Solar-system bodies, mainly aster oids (e.g., Hestrof fer et al.| 
|2010[ |Hestroffer & Tanga| |2014| ). A specific data-reduction 
pipeline with customised identification and centroiding algo¬ 
rithms has been implemented in DPAC for these moving, gen¬ 
erally unresolved objects. Like for unresolved galaxies, we do 
not take asteroids into account for the optimisation of the re¬ 
jection parameters (Section [4]) albeit we do assess their detec¬ 
tion performance using GIBIS simulations. Compared to cur¬ 
rent and upcoming ground-based surveys, Gaia’s limiting mag¬ 
nitude is modest. However, Gaia has the unique capability to 
discover new near-Earth objects (NEOs) at low Solar elongation, 
i.e., the faint end of the detected population is of particular in¬ 
terest and important for the science-alerts-driven ground-based 



Fig. 10. Examples of extreme galaxy profiles in SM CCDs sim¬ 
ulated with GIBIS. Exponential disk profiles are shown in the 
top panel, while de Vaucouleurs profiles are shown in the bot¬ 
tom panel. The colour map is logarithmic and encodes the flux 
in each pixel in electrons. The profiles do not appear circularly 
symmetric since the pixels in this representation are square while 
Gaia’s pixels are recta ngular. In our detection-performance as¬ 
sessment (Section |7.2| ), individual images of all objects, at the 
correct angle for each transit, are generated and analysed. 



Fig. 11. Example GIBIS images of main-belt asteroids (left) and 
near-Earth objects (right) in SM CCDs. The colour map is loga¬ 
rithmic and encodes the flux in each pixel in electrons. 


follow-up network Gaia-FUN-SSO ( [Thuillot et al.| |2014| ). We 
hence distinguish two groups, the main-belt asteroids (MBAs) 
and NEOs; the latter are generally fainter and have larger appar¬ 
ent motion. The asteroid velocity vectors are randomly sampled 
from the distributions from |Mignard et al.] ( [2007[ ). Since the mo¬ 
tion of asteroids around the Sun is within some tens of degrees 
from the Laplacean plane, their motion relative to the Gaia focal 
plane is not uniformly distributed: speeds are on average larger 
in the across-scan direction. To produce statistics for the detec¬ 
tion analysis for each type of asteroid, ten independent simu¬ 
lation grids (across-scan speed versus along-scan speed versus 
magnitude between V = 14 and 21 mag) have been created, re¬ 
sulting in 4640 MBAs and 4640 NEOs. The asteroids have been 
shuffled around at random positions in the focal plane between 
the different simulations to average out any possible positional 
dependency. Figure [IT] shows two examples of asteroid images. 


4. Optimising the free parameters 

4.1. Defining the merit function 

In order to optimise the 20 free parameters of the low- and high- 
frequency rejection curves, we need to define a merit function. 
First, it is important to realise that the low- and high-frequency 
curves are independent. The 20-dimensional problem hence re¬ 
duces to two 10-dimensional problems. After some experiment¬ 
ing, we settled - for both the low- and the high-frequency opti¬ 
misation - on the functional form: 

P( u) = E*(u) • P**^*(u) • P„_>**(u) • 
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[I-Pcr(u)] • [1 -P S p(u)], (13) 

where the 10 -dimensional vector u = 
a^b^c^d^e^) is the vector of unknowns (free parameters) of 
either the low- or the high-frequency problem; the subscript —> 
denotes the along-scan parameters whereas the subscript T de¬ 
notes the across-scan parameters. The subscript * stands for a 
single star, ** —> * for a double star inducing a single detec¬ 
tion, ** —> ** for a double star inducing two detections, CR 
for cosmic ray, and SP for Solar proton. The general symbol P 
denotes detection probability, i.e., the fraction of objects which 
fall above the high-frequency curve in the high-frequency case 
or below the low-frequency curve in the low-frequency case. In 
essence, the merit function from Equation ( [13] ) defines a balance 
between single- and double-star detection versus cosmic-ray and 
Solar-proton rejection: the higher P, the better Gaia’s (stellar) 
science return. We do not consider the detection performance 
of external galaxies and/or asteroids in the merit function since 
these objects are not a core science product: Gaia is a Galactic 
astrometry mission and the on-board detection should be opti¬ 
mised for stars. 

The detection probability of single stars, P*(u), is calculated 
as: 

20 

PM) = E W G ■ PgM), (14) 

G=13 

where the summation is over the G-magnitude range of interest, 
wg denotes the weight of each magnitude bin, i.e., the fractional 
number of stars in that bin from the standard Gaia Galaxy model 
(Table [I]), and Pg*(u) denotes the average detection probability 
of the Ng simulated stars in each magnitude bin (recall that Ng = 
100,000 for G = 13,..., 19, while N g =i o = 50,000): 


Pg* = 


! N g 

-Y 

Jrz AJ 


N g 


[(([Vo,/ + 0->] 18 • [V2 ,i + ^]l8)4 
< for low frequency 
> for high frequency 


1 if 


• C- 


[(mh+d- 


J18 


+ e. 


•)j 


4J32 


[(([ho,i + 0r]l8 ' \hlt + ^t]l8) 4 • C T )J 
< for low frequency 
> for high frequency 

[([(G>2 + <*r]l 8 + e l) 4 ] 32 

0 otherwise, 

where F t = voj + - ho,i + h\,i + ^ 2 ,i is the (background- 

subtracted) LSB flux of star i in the 3 x 3-samples working win¬ 
dow, and Vjj and hjj denote the LSB flux sums of the / h vertical 
(across-scan) and horizontal (along-scan) vectors of the 3x3- 
samples working window of star i (see Section [2] Equation [I]). 
The saturation and truncation operators [.. .]„ and (.. .)„ are de¬ 
fined in Section 1^21 

The detection probabilities of double stars, />**_>*(u) and 
P**_>**(u), are calculated along the same line as the detection 
probability for single stars. The detection probabilities of cos¬ 
mic rays and Solar protons, Pcr(u) and Psp(u), are calculated 
nearly the same, the only difference being that the weights wg 
are all identical to 1 since the probability of occurrence of a par¬ 
ticular event with a certain energy (i.e., magnitude) is al ready 
covered in the creation of the event catalogues (see Section [3~4]). 


Table 1. Statisti cs of the Gaia Universe Model Snapshot GUMS 
(Robin et al., |2012| ). N denotes the number of objects in the 
model in each magnitude bin (not to be confused with Ng which 
denotes the number of simulated objects in magnitude bin G); 
wg denotes the relative, normalised weight of each bin, such that 
Yjg =\3 w g - 1; P*,min and P**,min denote the minimum-required 
detection probabilities for single and double stars, respectively 
(Equation [16]). The square root refers to the fact that P refers to 
either the high- or the low-frequency detection probability; the 
final detection probability is the “logical AND” (i.e., the “prod¬ 
uct”) of these probabilities. 


G 

[mag] 

G range 
[mag] 

N 

[10 6 stars] 

w G 

Pg*, min 

P G**,min 

13 

12.5-13.5 

10 

0.0092 

Vo.9999 

V099 

14 

13.5-14.5 

24 

0.0223 

Vo.9999 

VO99 

15 

14.5-15.5 

38 

0.0351 

VO.9999 

V099 

16 

15.5-16.5 

71 

0.0660 

VO.9999 

V099 

17 

16.5-17.5 

125 

0.1167 

VO.9999 

V097 

18 

17.5-18.5 

183 

0.1713 

VO.9999 

VoV7 

19 

18.5-19.5 

377 

0.3526 

VO.9999 

V097 

20 

19.5-20.0 

243 

0.2268 

VO.9999 

V097 


4.2. Regularising the merit function 

With the choice made above to link the weights wg to the fre¬ 
quency of occurrence of stars in the sky, bright stars (G ~ 13- 
lb mag) implicitly receive reduced weight compared to faint 
stars since the latter are (far more) numerous. This is desirable 
to some extent but risks not detecting a disproportionate fraction 
of bright stars, which generally have high scientific importance 
j and small astrometric errors. We therefore introduce regularisa- 
3 \ion factors R * and R ** in the merit function P( u) as defined in 
Equation ( [13] ) enforcing a minimum detection performance for 
single and double stars which varies as function of magnitude: 


20 


($) = P| R g * 


G= 13 


with 



tf Pg* — PG*,min 
otherwise, 


(16) 


32 and similar for double stars (. R **). 

Gaia’s scientific mission requirements entail at least 95% 
on-board observation efficiency for single and double stars over 
the full magnitude range, down to the faint limit G = 20 mag. 
This implies that the detection probability shall be even higher 
than 95% since other losses exist (for example, there is a finite 
confirmation probability in AF1, 0.2% of faint-object transits is 
lost as a result of prioritised allocation of windows to bright 
stars, 0.1% of transits is lost as a result of focal-plane “blind¬ 
ing” caused by nearby bright stars or planets, etc.; Section [24] ). 
Since in early industrial software verification tests >98% detec¬ 
tion performance on single stars has been reached, and since ex¬ 
periments with our software indicate that single-star detection 
percentages of 99.99% can be reached, we adopt threshold val¬ 
ues (Table [l]) P G *, min = V0.9999 and Pg**, min = V0.99 for 
12.5 < G [mag] < 16.5 (bins G = 13,..., 16) and Pg* ; ™ in = 
V0.9999 and P G**,min = V0.97 for 16.5 < G [mag] < 20 (bins 
G = 17,..., 20). The square roots refer to the fact that P defines 
either the high- or the low-frequency detection probability; the 
total detection probability is the “logical AND” (i.e., the “prod¬ 
uct”) of these probabilities. 
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4.3. Optimising the merit function 


To optimise the regularised merit function (P( u) • R * • R ** from 
Equations[T3|and[ T6]), we u se the downhill-simplex minimisation 


method (Nel der & Mead| |1965| |Press et ah) |2007| in practice, 


since we want P to be maximised, we minimise 1 - P( u) • R * 
R **). For both the low- and high-frequency problems, we adopt 
a three-step minimisation approach: 


1. We first explore the full parameter space (-32,768 to 
+32,767 for each parameter) in a coarse manner, using 
randomly-placed starting simplices with large characteris¬ 
tic length scales (10,000) and a reduced set of data (10% 
of all objects, randomly selected from our object/event cat¬ 
alogues). These settings allow to repeat the optimisation a 
large number of times within a reasonable time (e.g., 12 days 
for ~50,000 repeats on a normal workstation), enabling deep 
exploration of the full parameter space. 

2. We then zoom in on the minimum found in the previous step 
and start the optimisation again in that area - still allowing 
the starting simplex to vary from run to run over the char¬ 
acteristic length scale - but now with reduced characteris¬ 
tic length scales (typically -100 for a , b , and c and -1000 
for d and e) and with the full set of objects (750,000 sin¬ 
gle stars, 750,000 double stars generating one local maxi¬ 
mum, 375,000 double stars generating two local maxima, 
750,000 Solar-proton-induced local maxima, and 750,000 
cosmic-ray-induced local maxima). We repeat this minimi¬ 
sation 1,000 times. 

3. We finally restart the optimisation from the minimum found 
in the previous step, but now with further-reduced charac¬ 
teristic length scales (typically by a factor 10 compared to 
the previous step). We repeat this minimisation 100 times. 
The outcome of this step yields the optimised vector u of 
unknowns as well as the achieved detection performance of 
stars and rejection performance of cosmic rays and Solar 
protons. These are discussed further in Section [5] 


5. Results 

After optimisation, the merit function (Equation |T3]) reaches 
P = 14.51 for the low-frequency case, with - by construction 
- regularisation factors R * = R ** = 1, compared to P = 12.04 
for the baseline parameters. In the latter case, however, the min¬ 
imum detection percentages defined in Table [I] are not met, nei¬ 
ther for single nor for double stars, i.e., R * = R ** = 0. All low- 
frequency star-detection probabilities have improved: P* lf went 
from 99.964% to 99.999%, P**^*, LF from 98.417% to 99.867%, 
and P**_>**, lf from 98.308% to 99.961%. At the same time, the 
low-frequency cosmic-ray and solar-proton detections also im¬ 
proved: P cr,lf went from 65.843% to 63.123% and Psp,lf from 
63.560% to 60.587%. For the high-frequency optimisation, we 
reached P = 79.91 (with R * = R ** = 1), compared to P = 80.39 
for the default settings; again, the functional baseline does not 
meet the minimum detection percentages defined in Table [T] nei¬ 
ther for single nor for double stars, i.e., R * = R ** = 0. As for the 
low-frequency case, all high-frequency star-detection probabili¬ 
ties improved: P* ;H f went from 99.997% to 99.998%, P**_>*,hf 
from 99.999% to 100.000%, and P**_>**,hf from 99.963% to 
99.968%; the prompt-particle-event performance slightly de¬ 
graded, from 11.668% to 11.717% for Pcr,hf and from 8.951% 
to 9.453% for Psp,hf- 

After combining the low- and high-frequency results, the fol¬ 
lowing situation emerges: the single-star (faint-star) detection 



G [mag] 


Fig. 12. Single-star detection probability - without any flux 
thresholding - as function of G magnitude for both the 
functional-baseline (red) and the optimised (blue) rejection pa¬ 
rameters. 


probability P* increases from 99.961% to 99.997%; the proba¬ 
bility P**^* to detect a double star as one detection (“unresolved 
double star”) increases from 98.417% to 99.866%; the probabil¬ 
ity P**_>** to detect a double star as two detections (“resolved 
double star”) increases from 98.271% to 99.928%; the probabil¬ 
ity Pcr to detect a cosmic ray decreases from 6.349% to 5.276%; 
and the probability Psp to detect a Solar proton decreases from 
3.401% to 3.064%. The magnitude dependence of these results is 
provided in Table [3j for comparison, Table [2] presents the mag¬ 
nitude dependence of the functional baseline. One can imme¬ 
diately conclude that the functional baseline for the rejection 
parameters provides a starting point which meets the single¬ 
star scientific requirements of the mission (albeit not the more 
stringent minimum detection percentages defined in Table [I]). 
Nonetheless, we have found room for optimisation, the main 
reason being that we have no constraint beyond G ~ 18.5 mag 
to reject cosmic rays and/or Solar protons, simply because such 
events do not e xist in significant quantities (see the discussion 
in Section 3A). So, the flux-dependence freedom of the rejec¬ 
tion curves for faint objects has been used in the optimisation 
to select virtually all detections (local maxima). This is, clearly, 
beneficial for extended objects, in particular unresolved galax¬ 
ies and asteroids (see Sections |T2| and |7.3| ). The price to pay is, 
of course, that also ghosts (Section |3.3| ) are now frequently de¬ 
tected: whereas the functional baseline only lets 1.800% of the 
ghosts through, this increases to 99.866% for the optim ised pa¬ 
rameters. This side effect is discussed further in Section [6~4l 

Figure[l2]shows the single-star detection probability as func¬ 
tion of G magnitude for both the functional-baseline and the op¬ 
timised rejection parameters. These results do not involve a flux 
thresholding: they purely reflect the intrinsic detection perfor¬ 
mance of Gaia, including the effect of the rejection parameters. 
Surprisingly, therefore, the baseline parameters already show the 
start of a downgoing trend in the detection probability of stars 
brighter than the nominal threshold of G = 20 mag. The opti¬ 
mised parameters, on the other hand, show a constant probabil¬ 
ity, close to 100%, up to G = 21 mag (compared with ~40% for 
the functional-baseline parameters reached at G = 21 mag). 

Figures [T3] - [T6] provide two-dimensional contour plots of the 
merit function P( u) and the regularised merit function P(u ) • 
R* • R ** for the various frequency-direction combinations. As 
one can clearly see by the presence of sharp boundaries, the 
regularisation - introduced to maintain the star-detection proba- 
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Fig. 13. Contour plots, for the low-frequency, along-scan case, of all ten parameter combinations. The panels above the diagonal 
refer to the merit function P( u) while the panels below the diagonal refer to the regularised merit function P( u) • R* • R **. The panels 
are centred on the optimised parameter values (intersection of the black lines) and cover a range of 100 for a , b , c, and d and 500 
for e. White areas refer to parameter combinations which violate the minimum detection percentages defined in Table [I] 
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Table 2. Magnitude dependence of object-detection probabili¬ 
ties for the functional-baseline rejection parameters. The sym¬ 
bol indicates the absence of faint cosmic rays and Solar 
protons in o ur prompt-particle-event catalogues, as explained in 
Section |T4| The magnitude-averaged star-detection probabilities 
Pg*-> Pg**^*> and Pg **^** in the last line are weighted with the 
Galaxy-model weights wg from Table I] The functional-baseline 
detection performance is not compatible with the minimum de¬ 
tection percentages defined in Table [I 


G range 
[mag] 

Pg* 

[%] 

Pa**-,* 

[%] 

Pa**—,** 

[%] 

Pa ,cr 

[%1 

Pa, sp 

[%] 

12.5-13.5 

100.000 

99.917 

99.188 

8.584 

15.781 

13.5-14.5 

100.000 

99.929 

99.302 

11.218 

12.091 

14.5-15.5 

100.000 

99.936 

99.263 

4.589 

6.346 

15.5-16.5 

100.000 

99.946 

98.763 

11.957 

1.048 

16.5-17.5 

100.000 

99.886 

99.027 

2.780 

0.454 

17.5-18.5 

100.000 

99.808 

99.121 

7.646 

2.651 

18.5-19.5 

99.999 

99.326 

98.660 

3.871 


19.5-20.0 

99.831 

94.306 

96.200 

- 


12.5-20.0 

99.961 

98.417 

98.271 

6.349 

3.401 


6. Discussion 

6.1. Solar protons 

As already explained in Section [3~~4{ the Solar-proton rate varies 
with time from essentially zero during Solar-quiet times to ex¬ 
tremely high fluxes during Solar flares. In practice, however, 
the Sun behaves bi-modally: it is either “quiet”, i.e., not emit¬ 
ting protons, or “bursting”, i.e., emitting such a high proton flux 
that Gaia’s star trackers are blinded, the spacecraft goes into 
transition mode, and scientific-data collection is suspended. In- 
between states do not really exist, except for the very short, in¬ 
termittent states corresponding to the rise and fall (onset and off¬ 
set) of Solar flares. One might therefore argue that Solar protons 
(i.e., the factor [1 - Psp(u)]) should not be included in the merit 
function, Equation ( [13] ). In practice, however, the inclusion or 
exclusion of protons in the merit function does not significantly 
affect the results of the optimisation since the shape and magni¬ 
tude distributions of protons resembles those of cosmic rays. We, 
somewhat arbitrarily, decided to include a Solar-proton factor in 
the merit function. 


Table 3. As Table § but for the optimised rejection parameters. 


G range 
[mag] 

Pa* 

[%] 

P 

[%] 

P 

[%] 

Pg, cr 

[%] 

Pg, sp 

[%] 

12.5-13.5 

100.000 

99.726 

98.998 

5.722 

12.672 

13.5-14.5 

100.000 

99.713 

99.303 

7.387 

8.692 

14.5-15.5 

100.000 

99.579 

99.641 

3.212 

5.232 

15.5-16.5 

99.997 

99.550 

99.775 

9.645 

1.424 

16.5-17.5 

99.995 

99.505 

99.896 

2.318 

0.889 

17.5-18.5 

99.997 

99.869 

99.978 

6.543 

3.011 

18.5-19.5 

99.999 

100.000 

99.994 

1.864 

- 

19.5-20.0 

99.995 

100.000 

99.994 

- 

- 

12.5-20.0 

99.997 

99.866 

99.928 

5.276 

3.064 


bilities above some minimum-acceptable thresholds (Table [I]) - 
does influence the results. Without regularisation, better figures 
of merit could be obtained but such solutions would sacrifice 
either too many star detections to reduce prompt-particle-event 
detections or too many bright-star detections (G ^ 16 mag) to 
improve faint-star performance (G ^ 18 mag). One can also see 
from the various panels that many parameters are correlated: the 
contour regions are often (strongly) elongated. This is not sur¬ 
prising since the rejection equations have been designed to offer 
coarse and fine adjustment ( [Massartl 2012| ): roughly speaking, 
for a given flux level F, parameters a and b determine the val¬ 
ues of the vertical and horizontal asymptotes, parameter c de¬ 
termines the coarse position of the vertex of the rejection curve 
(Figure [2]), and parameters d and e can be used to fine-tune the 
vertex position. It is hence not surprising to see that the opti¬ 
mum values of c are not too different from the functional base¬ 
line (low-frequency: 170 and 160 versus 155 and 160 for along 
and across scan, respectively; high-frequency: 2447 and 2569 
versus 2667 and 2667 for along and across scan, respectively). 


6.2. Secondary particles 

Whilst shielding a CCD will stop a fraction of the (low-energy) 
prompt-particle events, excessively thick shielding will intro¬ 
duce a flux (“shower”) of secondary particles created by the 
electromagnetic interaction at nuclear level between the primary 
particles (i.e., protons and Helium nuclei) and the shielding. 
However, these secondaries only become significant for shield 
thicknesses in excess of ~ 10 cm of Aluminium (e.g., |Dale et al. 
1993[ ). For Gaia, the effective shielding thickness is at the leve 
of 11 mm of Aluminium, implying that secondary particles are 
(likely) not significant. Nonetheless, an exploratory study for 
Gaia has been made by the Space Environments and Effects 
Analysis Section in the Technical Directorate of the European 
Space Agency (G. Santin, 2009, private communication), as¬ 
suming, as a very worst case, CCD shieldings of 2.0 cm of 
Aluminium plus 1.5 cm of SiC from the back and 3.5 cm of glass 
from the front. Various nuclear collision processes (hadronic 
models, quark gluon strings, and binary cascades) have been 
considered and the results indicate that secondaries will not be 
present in large numbers. Our prompt-particle-event catalogues 
are hence adequate for Gaia. 


6.3. AF1 confirmation 

As explained in Section [24] the confirmation criterion is purely 
flux-based: the “PSF shape” of the confirmed object is not tested. 
In the frame of this investigation, we can thus ignore the AF1 
confirmation stage. Basically, this stage is only useful to pre¬ 
vent Solar protons and cosmic rays which accidentally pass the 
SM detection stage (as well as the pre-selection and resource- 
allocation stages) from being windowed throughout the focal 
plane. This is beneficial only for the volume of science data 
transmitted to ground: prompt-particle events which pass the SM 
filter and are only uncovered in AF1 as spurious detection do al¬ 
ready occupy a window at that stage and this resource allocation 
is irreversible, i.e., this window can be dropped but cannot be 
given back to observe one more star. In other words: the number 
of prompt-particle-event detections shall really be minimised at 
SM-detection level. 
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Fig. 17. Properties of ghosts brighter than G = 20 mag originating from single stars in the range G = 6-20 mag. Panel 1 : across- 
scan rejection plot of ghosts, showing ghosts in the across-scan wings as red symbols and ghosts in the along-scan wings as black 
symbols. Panel 2: as Panel 7, but for the along-scan direction. Panel 3: average number of ghosts that a (parent) star of magnitude G 
generates. This is essentially the same plot as Panel 3 in Figure [7]but for an extended magnitude range of parent stars and displayed 
on a logarithmic scale. The black curve denotes all local maxima associated with ghosts, the blue curve - mostly overlying the black 
curve - denotes those local maxima passing the optimised rejection parameters defined in Section]?] and the red curve denotes those 
local maxima passing the functional-baseline rejection parameters . Th e dashed blue curve represents the alternative set of optimised 
parameters with reduced ghost sensitivity, as discussed in Section [64] 


6.4. The impact of non-rejected ghosts 

Section[5]already concluded that a byproduct of the optimised re¬ 
jection parameters is the increased sensitivity to ghosts: whereas 
the functional baseline only lets 1.800% of the ghosts gener¬ 
ated by parent stars in the range G = 12.5-20 mag through, 
this increases to 99.866% for the optimised parameters. To put 
these numbers in perspective, we extend the analysis presented 
in Section [33] (and Figure [ 7 ]), which is based on parent stars in 
the range G = 12.5-20 mag, to parent stars covering the range 
G = 6-20 mag. Panel 3 in Figure [IT] shows that stars as bright 
as G = 6 mag typically generate more than 250 ghosts on an 
SM-CCD transit; their m agni tude distribution is essentially the 
same as found in Section [ 73 ] strongly peaking at G = 20 mag. 
Panels 1 and 2 show that ghosts occupy very distinct areas in the 
rejection plots: ghosts in the across-scan wings resemble stars in 
the across-scan rejection plot and resemble ripples in the along- 
scan rejection plot. Similarly, ghosts in the along-scan wings re¬ 
semble stars in the along-scan rejection plot and resemble rip¬ 
ples in the across-scan rejection plot. Whereas the functional- 
baseline rejection parameters stop the vast majority of ghosts, 
the optimised rejection parameters do the opposite and let most 
of them through. 

After folding the average number of ghosts that a parent star 
generates with the total number of stars of that magnitude in the 
sky, it is easy to compute the increase in the number of detected 
“sources” caused by ghosts: it is +12.98% for the optimised pa¬ 
rameters, versus +0.22% for the functional baseline. These num¬ 
bers, however, are by far not the full story: 


- As explained in Section |6.3| ghost detections made in SM 
need to pass the (flux-based) confirmation stage in AF1 
before being assigned a window in AF and before being 
telemetered to ground. Since the noise pattern in AF1 will 
differ from that in SM, not all ghost detections will be con¬ 
firmed in AF1. GIBIS simulations suggest, however, that the 
majority of ghosts detections are confirmed in AF1. 

- In addition to increasing telemetry, ghost detections can be 
harmful for the faint-end transit (and catalogue) complete¬ 


ness - in particular in dense areas - since they occupy re¬ 
sources (windows), the number of which in AF is limited 
to W = 20 at each TDI line (see Section [2~4] ). One partic¬ 
ular aspect with the ghosts is that ghost detections in the 
across-scan wings have the same along-scan (TDI) coordi¬ 
nate, and hence compete mutually - as well as with the par¬ 
ent star and with other stars that happen to be present at that 
along-scan position in the same CCD - for the W = 20 re¬ 
sources available per TDI line (in addition, there is the pre¬ 
selection limitation of 5 detecti ons p er TDI line that can enter 
the resource allocation; Section [2^ . Ghost detections in the 
along-scan wings, on the other hand, do typically not mu¬ 
tually compete for resources but “only” compete with other 
stars. Nonetheless, the overall conclusion is that the num¬ 
ber of ghost detections shall preferably be minimised at SM- 
detection level. 

As a result, we performed some experiments to find solutions 
for the rejection parameters which improve upon the functional- 
baseline results for what regards single- and double-star detec¬ 
tions but which do not let through a large percentage of ghosts. 
This proved possible but not without penalty (see Table]?]): the 
ghost-detection probability dropped from 99.866% to 10.841% 
(cf. 1.800% for the functional baseline) while, at the same 
time, the star detection probabilities improved from 99.961% 
to 99.986% for P* (optimised: 99.997%), from 98.417% to 
99.221% for P**_>* (optimised: 99.866%), and from 98.271% 
to 99.119% for P **_>** (optimised: 99.928%); however, this 
performance increase was achieved at the expense of reduced 
prompt-particle-event rejection capabilities: Pcr increased from 
6.403% to 11.186% (optimised: 5.276%) while P S p increased 
from 3.401% to 5.219% from (optimised: 3.064%). 

6.5. Robustness 

One may ask how robust the optimised parameters are to, for 
instance, radiation-damage effects, noise, PSF-shape (changes), 
etc. It is important in this respect to recall the following: 
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Table 4. As Table [ 3 J but for the alternative set of optimised re¬ 
jection parameters which sacrifice object-detection and prompt- 
particle-event rejection perfor man ce to improve the ghost rejec¬ 
tion performance (see Section [6~4]). 


G range 
[mag] 

Pg, 

[%] 

[%] 

P G**—>** 

[%] 

Pg, cr 

[%1 

P G,SP 

[%] 

12.5-13.5 

100.000 

99.951 

99.528 

10.443 

18.943 

13.5-14.5 

100.000 

99.972 

99.679 

13.523 

14.012 

14.5-15.5 

100.000 

99.970 

99.703 

5.231 

6.951 

15.5-16.5 

100.000 

99.982 

99.373 

15.586 

2.139 

16.5-17.5 

100.000 

99.945 

99.537 

9.784 

4.706 

17.5-18.5 

100.000 

99.905 

99.552 

11.339 

5.421 

18.5-19.5 

100.000 

99.664 

99.327 

7.312 

- 

19.5-20.0 

99.939 

97.202 

98.018 

- 

- 

12.5-20.0 

99.986 

99.221 

99.119 

11.186 

5.219 


1. We have deliberately chosen to define one set of rejection 
parameters applicable to all VPUs, despite the available de¬ 
gree of freedom, which could improve the detection perfor¬ 
mance further, to define optimised parameter sets for each 
VPU separately, or even separately for SMI and SM2 within 
each VPU. By design, therefore, the optimised parameters 
cover the large(st possible) variety of wave-front errors and 
PSF shapes, which clearly improves their robustness; 

2 . The figure-of-merit contour plots (Figures |T3|[14| p~5| andp~6|) 
show extended “good regimes” and gradual changes, sug¬ 
gesting that the detection / rejection performance is not sen¬ 
sitive to modest changes in the parameter values, provided 
that the correlations between parameters are considered; 

3. The detection algorithm is not strongly sensitive to 
sky-background or straylight, which manifest themselves 
as a constant background which is eliminated through 
the background-subtraction step. Clearly, enhanced sky- 
background or stray-light levels would induce extra noise. 
However, Figure [12] shows that the optimised parameters 
provide excellent performance not only at G = 20 mag 
(where the signal-to-noise ratio is 21) but at least down to 
G - 21 mag, i.e., when the signal-to-noise ratio has dropped 
to 13. Such a drop would also correspond, for instance, to a 
straylight level of 14 electrons per pixel per second; 

4. The detection algorithm works on integrated fluxes con¬ 
tained in samples, consisting of 2 x 2-binned pixels, of size 
0.12 x 0.36 arcsec. It merely inspects the PSF shape based 
on gross, zeroth-order quantities, namely the flux-vector ele¬ 
ments ho, h\, and h 2 (and vo, Vi, and V 2 ) and the total flux F. 
There is hence no strong sensitivity of the on-board detection 
to fine (milli-arc second-level) features in the PSF; 

5. The robustness of the detection algorithm to CCD degrada¬ 
tion from non-ionising radiation damage (see Footnote 4) 
has been evaluated by Airbus Defence & Space in a ded¬ 
icated laboratory test campaign using a CCD connected to 
the VPAs (Pasquier & Massart, |2012| . These tests have 
demonstrated that (long-term) PSF-shape changes induced 
by radiation damage, even for a worst-case, end-of-mission- 
accumulated radiation dose, are much more subtle and neg¬ 
ligible compared to (short-term) PSF “changes” caused by, 
for instance, across-scan motion, sub-pixel location, etc. 


In short, we believe the optimised parameters are robust against 
(instrumental) effects not included in our assessment. 


6.6. The real Gaia mission 

The study presented in this work shows that, compared to the 
functional-baseline rejection parameters, room for improvement 
exists if a different trade-off is applied: a better detection of 
celestial objects and rejection of cosmic rays and Solar pro¬ 
tons at the expense of more ghosts. The alternative parame¬ 
ters represent an intermediate option. These results will be val¬ 
idated in orbit during the commissioning phase of the mission 
by means of a four-day test in which the functional-baseline 
rejection parameters will be used during 24 hours, followed 
by 24 hours of operations with the optimised parameters from 
Section [5] followed by 24 hours of operations with the alter¬ 
native set of optimised parameters from Section |6.4[ followed 
by 24 hours of operations with a set of rejection parameters 
which effectively do not filter any local maximum. During this 
test, the spacecraft will be operated in ecliptic-pole-scanning 
mode; this guarantees that each telescopes scans each of the 
ecliptic poles on each 6-hour revolution, with only a preces¬ 
sion in ecliptic longitude. The “continuously-observed” ecliptic- 
pole regions, the stellar content of which has been carefully ob¬ 
served from the ground prior to the launch through dedicated ef¬ 
forts by DPAC, constitute the best-available benchmark against 
which the Gaia-detection performance can be assessed, although 
the ground-based observations are limited in/by spatial reso¬ 
lution, bandpass-transformation errors, (unrecognised) variable 
stars, (unresolved) double stars, star-galaxy classification errors, 
etc. In addition, one should keep in mind that the ecliptic poles 
just represent two particular density regimes (low density for 
the north and average density for the south pole) which are not 
representative of dense areas such as the Galactic bulge. Both 
prompt-particle events and ghosts can be harmful, in particular 
in dense areas where all resources are needed by stars, albeit 
prompt-particle events and ghosts differ in the sense that cos¬ 
mic rays and Solar protons are always brighter than G ~ 19 mag 
whereas ghosts are always fainter than ~19 mag: prompt-particle 
events hence compete with bright stars whereas ghosts compete 
with faint stars. In areas with sufficient resources (the vast ma¬ 
jority of the sky in terms of area), such that additional ghosts 
or prompt-particle events can be supported, the preference is 
clearly to have a good star-detection efficiency. Spurious detec¬ 
tions are, and only if confirmed in AF1, “just” a nuisance for the 
data processing in such cases. This, however, is not necessarily 
true in dense areas. Based among others on the outcome of the 
in-orbit test, a decision will be made on which parameter set will 
be flown during nominal operations. The final trade-off and de¬ 
cision, however, is beyond the scope of this work, which mainly 
aims to put the various elements that go into the trade-off onto 
the table. Further calibration and adjustment of the rejection pa¬ 
rameters remains always possible during the mission. 


7. Scientific implications 

After having established optimised rejection parameters, it is in¬ 
teresting to assess their benefit for the science return of Gaia. 
Since it goes without saying that the improved performance of 
the single-star detection, in particular around G - 20 mag, will 
be beneficial to science, we focus on three other areas, namely 
double stars (Sect ion|7.1| ), unresolved galaxies (Section |T2] ), and 
asteroids (Section |7.3| ). 
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Fig. 20. Left-hand-side panels: configuration of rejected double stars leading either to a single local maximum (i.e., rejected un¬ 
resolved double stars; top panel; 4631 objects), or to a local maximum for both components individually (i.e., rejected secondary 
components of resolved double stars; bottom panel; 13, 376 objects) for the functional-baseline rejection parameters. Right-hand- 
side panels: orientation, in along- versus across-scan “coordinates”, of rejected objects leading either to a single local maximum 
(top panel), or to two local maxima (bottom panel). The dots represent the orientation of the secondary components with respect to 
their primary components, which are situated at the origin. The lines denote the CCD pixel grid; recall that Gaia’s detection is based 
on SM samples, composed of 2 x 2-binned pixels. 


7.1. Double stars 

Section lT2l summarises the contents of our double-star data set. 
We differentiate unresolved double stars, which only lead to one 
local maximum in the detection (symbolically ** —> *), from 
resolved double stars, which lead to two local maxima in the 
detection (symbolically ** —> **). Our resolved data set has 
375,000 underlying double-star systems, each with two resolved 
components, whereas the unresolved data set has 750,000 un¬ 
derlying double-star systems with only unresolved components. 
Compared to the AG and p ranges 0-5 mag and 0-354 mas used 


in the optimisation (Sections [4] and [5]), the simulated configura¬ 
tion space of double stars discussed here is limited to primaries 
in the range G = 12.5-21 mag and secondaries with magnitude 
difference AG = 0-8.5 mag, separation p = 0-1061 mas (0- 
18 AL pixels), and orientation a = 0°-360° (a = 0° denotes the 
along-scan axis whereas a = 90° denotes the across-scan axis). 
We only consider detections brighter than G = 20 mag. 

Actually, the double-star data set allows to quantify Gaia’s 
capability to resolve (close) double stars. The spatial distribution 
of the resolved secondary components with respect to their pri- 
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Fig. 21. As Figure 
respectively. 
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but for the optimised rejection parameters. The top and bottom panels contain 1746 and 2140 objects, 


mary companions is composed of a semi-uniform background 
with an ellipsoidal hole centred around the primary. The spa¬ 
tial distribution of the unresolved secondary components with 
respect to their primary companions displays a fully complemen¬ 
tary behaviour and shows a strongly-peaked distribution around 
the origin (“the primary”) which fits into the hole. The hole 
boundary represents the transition between resolved and unre¬ 
solved double stars, refering to Gaia’s detection capability of 
(close) double stars. The axis-ratio of this hole (1 : 3) reflects 
the rectangular pixel size (10 x 30 p m 2 , or 0.06 x 0.18 arcsec 2 ); 
this asymmetric sensitivity is not expected to introduce biases in 
the final Gaia catalogue since objects are typically observed 70 
times over the mission, with “random” scanning orientations. 

Figure [18] shows the probability of a double star to be re¬ 
solved into two local maxima as function of separation p and 


magnitude difference AG when averaged over orientation angle 
a, the idea being that even a few-percent probability means that 
the system will be resolved in at least one of the few dozen tran¬ 
sits acquired during the mission. Gaia’s resolving power does 
degrade with AG but does not vary with the primary’s magni¬ 
tude for a given AG value. Our results are consistent with those 
of |Spagna] ( [2014| ). Figure[l9| shows - for bright, equal-brightness 
double stars (G = 13 mag and AG = 0 mag) - how Gaia’s de¬ 
tection probability depends on orientation and separation. The 
best performance, 0.23 arcsec with 95% confidence level, is ob¬ 
viously found for double stars with a pure along-scan separation. 
For systems with a separation purely in the across-scan direction, 
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Fig. 18. Probability (in %) of a double star with a primary com¬ 
ponent with G = 13 mag to be resolved into two local maxima 
as function of separation p, in units of arcsec, and magnitude 
difference AG (averaged over all orientation angles). This proba¬ 
bility does not take the rejection curves into account; in practice, 
however, the results do not significantly change when applying 
the optimised rejection parameters. Contours at 5%, 25%, 50%, 
75%, 95.5%, 98%, and 100% have been labeled. 


190 


75 


60 


45 


30 


15 


0.8 



0.4 0.5 0.6 

Separation (arcsec) 


0.7 


Fig. 19. Probability of an equal-brightness double star (AG = 
0 mag) to be resolved into two local maxima as function of 
separation, in units of arcsec, and orientation angle, in units 
of degrees. The G magnitude of the primary has been fixed at 
G = 13 mag. This probability does not take the rejection curves 
into account; in practice, however, the results do not signifi¬ 
cantly change when applying the optimised rejection parame¬ 
ters. Contours at 5%, 25%, 50%, 75%, 95%, and 99% have been 
labeled. 

this number is a factor three worse, while for double stars with a 
random orientation, this number is a factor two wors^] 


Figure [20] focuses on the impact of the rejection equations 
on close double stars: the top panels refer to unresolved sys¬ 
tems, i.e., systems which lead to one local maximum, whereas 
the bottom panels refer to resolved systems, i.e., systems which 
lead to two local maxima. Figure [2o] only shows the proper¬ 
ties of the secondary component of those systems which have 
been rejected by the functional-baseline rejection parameters. 
The spatial distribution of the secondary components with re¬ 
spect to their primary companions for the 4631 rejected unre¬ 
solved double stars (top panels) is composed of a uniform back¬ 
ground plus an ellipsoidal ring centred around the primary, of 
radius ~0.2 arcsec (-3 AL pixels) in the along-scan direction 
and -0.6 arcsec (-3 AC pixels) in the across-scan direction, cor¬ 
responding to marginally unresolved systems. The vast major¬ 
ity of rejected unresolved objects have G <: 19.5 mag (see col¬ 
umn Pg**^>* in Table [2]), meaning that their companions have 
AG £ 1 mag (since Gaia’s limit is G = 20 mag). This explains 
the rather sharp cut-off seen in AG-space. The spatial distribution 
of the 13,376 rejected secondary components with respect to 
their primary companions for the resolved systems (bottom pan¬ 
els) also follows a uniform background with an ellipsoidal ring 
superimposed, which is larger and more diffuse than for the un¬ 
resolved double stars. The magnitude distribution of the primary 
components in these systems is rather uniformly spread between 
G = 12.5 and 20 mag (see also column P G **_>** in Table[2]). 

Figure [21] shows the double-star-rejection results for the op¬ 
timised rejection parameters from Section [5] The main struc¬ 
tures from Figure [20] persist but are significantly thinned out, 
i.e., Gaia’s rejection performance of (close) double stars with the 
optimised rejection parameters is significantly better than with 
the functional baseline (4631 rejected unresolved double stars 
reduce to 1746 systems while 13,376 rejected resolved double 
stars reduce to 2140 objects). Striking is the absence of the uni¬ 
form background of rejected unresolved double stars achieved 
with the functional-baseline parameters, i.e., unresolved, faint 
double stars with large(r) separations are correctly classified 
with the optimised rejection parameters. One should, however, 
not over-interpret the importance of detecting both components 
of close double stars since what ultimately matters is whether 
the secondary component is observed or not. This condition can 
be claimed to be met if the secondary is sufficiently-well con¬ 
tained in the 0.4 x 2.1-arcsec 2 window of the primary, so for 
instance when the along-scan separation is less than 0.2 arc¬ 
sec and/or when the across-scan separation is less than 1 arcsec. 
Nonetheless, the observation always improves with a dedicated 
window centred on the secondary, allowing to improve the on¬ 
ground deconvolution of the (across-scan-binned) data. 

7.2. Unresolved galaxies 

To study the impact of the optimised parameters on the de¬ 
tectability of unresolved galaxies, we adopt the GIB IS simula¬ 
tions of exponential and de Vaucouleurs galaxy profiles sum¬ 
marised in Section |3.5| Figure [22] shows the resulting detection 
probabilities of the two extreme-profile cases as function of in¬ 
tegrated brightness and radius. Two well-defined regions with a 
sharp transition can be seen: a region where the profiles are al¬ 
most always detected, in red, and another where the profiles are 
not detected, in blue. Small-scale (noise) variations are due to the 
position of the galaxies in the focal plane and their position in the 
sky: since there are gaps between the CCD rows, and since some 
profiles may even fall outside the focal plane in some transits, 


(2jt)-' ■ £ da ([1 • cos af + [3 • sin a] 2 )‘ /2 = 2 • E(-8 )/tt ; 


2.13, 


where E(m ) is the complete elliptic integral of the second kind. 
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Fig. 22. Galaxy detection probability (in %) for the functional- 
baseline rejection parameters (top panels) and the optimised pa¬ 
rameters (bottom panels). The left-hand-side panels represent 
exponential disk profiles while the right-hand-side panels rep¬ 
resent de Vaucouleurs profiles. Noise structure shared between 
the upper and lower panels is due to the GIB IS simulation, for 
instance gaps between CCD rows. 
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Fig. 23. Detection probability (in %) for exponential disk pro¬ 
files with radius <0.5 arcsec for the functional-baseline rejection 
parameters (left) and the optimised parameters (right). 


the simulated profiles are a priori not expected to be observed in 
all transits and some random variation is expected. 

The detection maps obtained with the functional-baseline re¬ 
jection parameters, in the upper pannels, indicate that exponen¬ 
tial disk profiles are not detected except for the most compact 
ones. On the other hand, most simulated de Vaucouleurs profiles 
are detected, with the most compact ones (with radius <0.5 arc- 
sec) reaching the faint end of the simulated sources. The op¬ 
timised rejection parameters significantly improve the detection 
of exponential disk profiles, as can be seen in the lower-left panel 
of Figure [22] The diagonal transition between the red (bottom- 
left) and blue (upper-right) area in that panel does not mark the 
intrinsic detection performance of Gaia, i.e., the fact that a de¬ 
tection is necessarily linked to the presence of a local maximum 
(see Section [2] and Equation [2] in particular) but rather marks the 


faint-end threshold at G = 20 mag: for large(r) effective radii, 
the galaxy flux is severely underestimated on board as a result of 
an over-subtraction of the background induced by the presence 
of galaxy light in the samples from which the background is es¬ 
timated. As a result, for disk profiles, the region of the parame¬ 
ter space with successful detections is concentrated around the 
most compact or bright exponential profiles. The detection map 
shows an unexpected feature in the region of the brightest sim¬ 
ulated compact profiles, with radius <0.3 arcsec and integrated 
magnitude V < 16 mag: profiles lying therein are rejected as, not 
surprisingly, ripples. In order to assess this effect, we performed 
additional GIB IS simulations, including a s et of simulations with 
no-filter rejection parameters (see Section (h6), for exponential 
profiles with radii varying from 50 to 500 mas. The resulting de¬ 
tection maps in Figure [23] confirm (i) that the region of lower 
detections is a feature introduced by the optimised parameters, 
and (ii) that for radii <100 mas, the profiles are detected again 
for the entire magnitude range, up to the faint end of the simula¬ 
tions. The latter result is expected, as such profiles are almost in¬ 
distinguishable from single stars. The “detection hole” is caused 
by the interplay between the low-frequency, along-scan rejection 
curves (which - for G ^ 18 mag - are more strict for the opti¬ 
mised parameters than for the functional-baseline parameters), 
the size and brightness of the galaxy (which influence the [core] 
size of the image), the on-board background subtraction (which 
operates on the 5 x 5-samples [0.6 x 1.8 arcsec 2 ] ring around the 
sample of interest - see Section [2]), and the on-board magnitude 
estimation (which is severely biased - up to a few magnitudes 
for large(r) effective radii - towards fainter magnitudes). 


The results presented in Figure [22] for de Vaucouleurs pro¬ 
files (right panels) show that the optimised parameters have no 
significant impact on their detectability. As for disk profiles, 
the diagonal transition between the red (bottom-left) and blue 
(upper-right) area marks the faint-end threshold of Gaia. The 
same region of parameter space that was covered using the base¬ 
line rejection parameters is also covered with the optimised ones. 
However, the optimised parameters do improve the detectability 
of intermediate profiles that are between the exponential and the 
de Vaucouleurs profiles. 


Taking into account these results, we expect mostly elliptical 
galaxies and galaxy bulges to be detected by Gaia, and thus to 
be present in the catalogue, while late-type spiral galaxies - even 
those with weak bulges - will be mostly absent. One should keep 
in mind, however, that the window data transmitted to ground 
are of limited extent - typically 4.7 x 2.1 arcsec 2 (along-scan x 
across-scan) in SM and 0.4 x 2.1 arcsec 2 in AF - and of lim¬ 
ited sampling - typically 0.2 x 0.7 arcsec 2 per sample in SM and 
0.06 x 2.1 arcsec 2 per sample in AF - and hence not compara¬ 
ble to classical imaging data. These conclusions are a par with 
those by[de Souza et al. ( |2014| ). Nevertheless, even though few 
real galaxies are expected to populate the region of the param¬ 
eter space opened up by the optimised parameters, the adoption 
of this configuration will enable the exploration of a morpholog¬ 
ical regime that is exclusively available from space observations. 
Even the confirmation of no-detections due to the absence of real 
objects populating this region of the parameter space will pro¬ 
vide important constraints. Moreover, as Gaia is the only space- 
based all-sky survey in the visible wavelength domain and this 
is a regime that has never been explored so systematically and 
extensively as Gaia will be able to do, this prospective is scien¬ 
tifically invaluable and deserves to be pursued. 
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7.3. Asteroids 


To investigate how the adoption of the optimised parameters im¬ 
pacts the detection of asteroids, we use the GIBIS simulations 
of main-belt asteroids (MBAs) and near-Earth objects (NEOs) 
described in Section 3.6 The resulting detection maps for both 
types of objects, after averaging over the ten simulation grids 
and after taking the different projections of the velocity vector 
into account, are presented in Figure [24] They show that, in both 
cases, the optimised rejection parameters provide detection gains 
for objects fainter than 20 mag, in line with Figure[l2] Also, the 
results indicate that for high-velocity NEOs, with speed modulus 
exceeding ~80 mas s -1 , there is a -30% gain. To assess this ef¬ 
fect, we perform additional simulations of 4640 fast NEOs, with 
velocity modulus greater than 90 mas s -1 . The results are rep¬ 
resented in Figure [25] and show a significant detection gain for 
objects at the faint end. On the other hand, these simulations also 
show that fast, bright NEOs have slightly lower detection rates 
when the optimised parameters are adopted. 

The optimised rejection parameters allow a better coverage 
and completeness at the faint end of the sample population, in 
particular for fast-moving NEOs, albeit this gain comes at the 
expense of a -10-20% detection probability loss for bright (V < 
17 mag), fast-moving (>80 mas s -1 ) NEOs. However, the as¬ 
teroid population is far from uniformly distributed in brightness- 
velocity space: bright and fast-moving bolides are much less fre¬ 
quent than faint and slower ones, implying that the gain and loss 
give a net science improvement. The optimised parameters allow 
in particular to find new faint objects (V > 20 mag for MBAs 
and slow NEOs and V > 18 mag for fast NEOs). Unfortunately, 
fast-moving objects, once detected, may not be observed in all 
subsequent astrometric CCDs because the window propagation 
is the same for all detections and based on ’fixed stars’. Science 
alerts in this respect can confirm the detection and bring valu¬ 
able, complementary (astrometric) data. 


8. Conclusions 

We present a study of Gaia’s detection capability of objects, 
in particular (non-saturated) stars, double stars, and unresolved 
galaxies. We have developed an emulation of the on-board detec¬ 
tion software which has been carefully validated against the real 
software. The algorithm has 20 free, so-called rejection parame¬ 
ters governing the boundaries between stars on the one hand and 
point-like or elongated (high-frequency) prompt-particle events 
or extended (low-frequency) ripples on the other hand. We eval¬ 
uate the detection and rejection performance of the algorithm 
using catalogues of simulated single stars, resolved and unre¬ 
solved double stars, Galactic cosmic rays, and Solar protons. 
The functional-baseline rejection parameters allow to detect 
99.961% of single stars, 98.417% of unresolved double stars, 
and 98.271% of resolved double stars (Table [2]). At the same 
time, 6.349% and 3.401% of the cosmic rays and Solar protons, 
respectively, do not get rejected. 

After optimisation, we managed to improve these perfor¬ 
mances to 99.997% for single stars, 99.866% for unresolved 
double stars, 99.928% for resolved double stars, 5.276% for cos¬ 
mic rays, and 3.064% for Solar protons (Table[3]). The optimised 
rejection parameters also remove the artefact of the functional- 
baseline parameters that the reduction of the detection proba¬ 
bility of faint stars as function of G magnitude already sets in 
before the nominal threshold at G = 20 mag. 

We not-surprisingly find, as a result of the rectangular pixel 
size (along-scan : across-scan =1:3), that Gaia’s intrinsic 
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Fig. 24. Detection-probability maps (in %) for asteroids (left: 
MBAs; right: NEOs), using the functional-baseline rejection pa¬ 
rameters (top) and the optimised parameters (middle), as func¬ 
tion of V magnitude and velocity modulus. The bottom panels 
represent the resulting gain (optimised minus functional base¬ 
line). 


power to resolve close double stars - i.e., before the application 
of PSF-shape criteria through the rejection equations and param¬ 
eters in the detection process - is better in the along- than in the 
across-scan direction: the minimum separation to resolve - with 
95% confidence level - a close double star is 0.23 arcsec in the 
along-scan and 0.70 arcsec in the across-scan direction, which 
applies to equal-brightness double stars (AG = 0 mag), regard¬ 
less of the brightness of the primary. To resolve double stars with 
AG > 0 mag, larger separations are required. For a given value 
of AG, Gaia’s resolving power does not vary with the magnitude 
of the primary. 

Whereas the optimised rejection parameters have no sig¬ 
nificant (beneficial) impact on the detectability of pure de 
Vaucouleurs profiles, they do significantly improve the detection 
of pure exponential-disk profiles, and hence also the detection of 
unresolved external galaxies with intermediate profiles. The op¬ 
timised rejection parameters also improve the detection of faint 
asteroids and high-velocity near-Earth objects, albeit at the ex¬ 
pense of a modest detection-probability reduction of fast, bright 
near-Earth objects. 

The only major side effect of the optimised parameters is that 
spurious ghosts in the wings of bright stars also pass unfiltered 
(99.866% detection efficiency, versus 1.800% with the base¬ 
line). We have identified an alternative set of optimised parame¬ 
ters, which sacrifices some object-detection and prompt-particle- 
event rejection performance to reduce the ghost-detection sen- 
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Fig. 25. Detection-probability maps (in %) for NEOs faster than 
90 mas s -1 , using the functional-baseline rejection parameters 
(top, left) and the optimised parameters (top, right), as function 
of V magnitude and velocity modulus. The bottom panel repre¬ 
sents the resulting gain. 


sitivity from 99.866% to 10.841% (Table]?]). An in-orbit test 
during commissioning with the functional-baseline and the op¬ 
timised parameters will provide input for the ultimate decision 
which parameter set to operate the spacecraft with during its op¬ 
erational life. 
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Fig. 14. As Figure [l3] but for the low-frequency, 


across-scan case. 
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Fig. 15. 



As Figure [l3] but for the high-frequency, along 


-scan case. 







































































J.HJ. de Bruijne et al.: Detecting stars, galaxies, and asteroids with Gaia, Online Material p 3 



Fig. 16. As Figure [l3] but for the high-frequency, 


across-scan case. 








































































